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THERMOSTABLE LUCIFERASES AND METHODS OF PRODUCTION 



5 The government may have rights to this invention based on support 

provided by NIH 1R43 GM506 23-01 and 2R44 GM506 23-02 and NSF ISI- 
9160613 and 111-9301865. 

RELATED APPLICATIONS 

This application claims priority from copending U.S. Ser. No. 60/059,379 
10 filed September 19, 1997. 

FIELD OF THE INVENTION 

The invention is directed to mutant luciferase enzymes having greatly 
increased thermostability compared to natural luciferases or to luciferases from 
which they are derived as measured e.g. by half-lives of at least 2 hrs. at 50°C in 
15 aqueous solution. The invention is also drawn to polynucleotides encoding the 

novel luciferases, and to hosts transformed to express the luciferases. The 
invention is further drawn to methods of producing luciferases with increased 
thermostability and the use of these luciferases in any method in which previously 
known luciferases are conventionally employed. Some of the uses employ kits. 

20 BACKGROUND OF THE INVENTION 

Luciferases are defined by their ability to produce luminescence. Beetle 
luciferases form a distinct class with unique evolutionary origins and chemical 
mechanisms. (Wood, 1995) 

Although the enzymes known as beetle luciferases are widely recognized 

25 for their use in highly sensitive luminescent assays, their general utility has been 

limited due to low thermostability. Beetle luciferases having amino acid 
sequences encoded by cDNA sequences cloned from luminous beetles are not 
stable even at moderate temperatures. For example, even the most stable of the 
luciferases, LucPpe2, obtained from a firefly has very little stability at the 
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moderate temperature of 37° C Firefly luciferases are a sub-group of the beetle 
luciferases. Historically, the term "firefly luciferase" referred to the enzyme 
LucPpy from a single species Photinus pyralis (Luc + is a version). 

Attempts have been reported to mutate natural cDNA sequences encoding 
5 luciferase and to select mutants for improved thermostablity (White et al., 1 994; 

from P. pyralis and Kajiyama and Nekano, 1993, from Luciola lateralis.) 
However, there is still a need to improve the characteristics and versatility of this 
important class of enzymes. 

SUMMARY OF THE INVENTION 

10 The invention is drawn to novel and remarkably thermostable luciferases, 

including half-lives of at least 2 hrs. at 50°C or at last 5 hrs. at 50°C in aqueous 
solution. The mutant luciferases of the present invention display remarkable and 
heretofore unrealized thermostability at room temperature (22°C) and at 
temperatures at least as high as 65°C. The invention is further directed to the 

15 mutant luciferase genes (cDNA) which encode the novel luciferase enzymes. The 

terminology used herein is, e.g. for the mutants isolated in experiment 90, plate 
number 1, well B5, the E. coli strain is 90-1B5, the mutant gene is luc90-lB5 y and 
the mutated luciferase is Luc90-JB5. 

By thermostability is meant herein the rate of loss of enzyme activity 

20 measured at half life for an enzyme in solution at a stated temperature. Preferably, 

for beetle luciferases, enzyme activity means luminescence measured at room 
temperature under conditions of saturation with luciferin and ATP. 
Thermostability is defined in terms of the half-life (the time over which 50% of 
the activity is lost). 

25 The invention further encompasses expression vectors and other genetic 

constructs containing the mutant luciferases, as well as hosts, bacterial and 
otherwise, transformed to express the mutant luciferases. The invention is also 
drawn to compositions and kits which contain the novel luciferases, and use of 
these luciferases in any methodology where luciferases are conventionally 

30 employed. 
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Various means of random mutagenesis were applied to a luciferase gene 
(nucleotide sequence), most particularly gene synthesis using an error-prone 
polymerase, to create libraries of modified luciferase genes. This library was 
expressed in colonies of E. coli and visually screened for efficient luminescence to 

5 select a subset library of modified luciferases. Lysates of these E. coli strains were 

then made, and quantitatively measured for luciferase activity and stability. From 
this, a smaller subset of modified luciferases was chosen, and the selected 
mutations were combined to make composite modified luciferases. New libraries 
were made from the composite modified luciferases by random mutagenesis and 

10 the process was repeated. The luciferases with the best overall performance were 

selected after several cycles of this process. 

Methods of producing improved luciferases include directed evolution 
using a polynucleotide sequence encoding a first beetle luciferase as a starting 
(parent) sequence, to produce a polynucleotide sequence encoding a second 

15 luciferase with increased thermostability, compared to the first luciferase, while 

maintaining other characteristics of the enzymes. A cDNA designated lucppe2 
encodes a firefly luciferase derived from Photuris pennsylvanica that displays 
increased thermostability as compared to the widely utilized luciferase designated 
LucPpy from Photinus pyralis. The cDNA encoding LucPpe2 luciferase was 

20 isolated, sequenced and cloned (see Leach, et al t . 1997). A mutant of this gene 

encodes a first luciferase LucPpe2 [T249M]. 

In an embodiment of a mutant luciferase, the amino acid sequence is that of 
LucPpe2 shown in FIG. 45 with the exception that at residue 249 there is a T 
(designated T249 M) rather than the M reported by Leach et ah The bold, 

25 underlined residue (249) shows mutation from T to M. This enzyme produced 

approximately 5-fold more light in vivo when expressed in E. coli. Double- 
underlined residues were randomized by oligonucleotide mutagenesis. 

Diluted extracts of recombinant E. coli that expressed mutant luciferases 
made by the methods of the invention were simultaneously screened for a plurality 

30 of characteristics including light intensity, signal stability, substrate utilization 

(K m ), and thermostability. A fully automated robotic system was used to screen 
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large numbers of mutants in each generation of the evolution. After several cycles 
of mutagenesis and screening, thereby creating mutant libraries of luciferases, an 
increased thermostability compared to LucPpe2 [T249M] of about 35°C was 
achieved for the most stable clone [clone Luc90-1B5] which also essentially 
5 maintained thermostability (there was only negligible loss in activity of 5%) when 

kept in aqueous solution over 2 hrs. at 50°C, 5 hours at 65°C, or over 6 weeks at 
22°C. 

Mutant luciferases of the present invention display increased 
thermostability for at least 2 hrs. at 50°C, preferably at least 5 hrs. at 50°C in the 

10 range of 2-24 hrs. at 50°-65°C. In particular, the present invention comprises 

thermostable mutant luciferases which, when solubilized in a suitable aqueous 
solution, have a stability half-life greater than about 2 hours at about 50°C, more 
preferably greater than about 10 hours at 50°C, and more preferably still greater 
than 5 hours at 50°C. The present invention also comprises mutant luciferases 

15 which, when solubilized in a suitable aqueous solution, have a stability half-life 

greater than about 5 hours at about 60°C, more preferably greater than about 10 
hours at about 60°C, and more preferably still greater than about 24 hours at about 
60°C. The present invention further comprises mutant luciferases which when 
solubilized in a suitable aqueous solution have a stability half-life greater than 

20 about 3 months at about 22°C, and more preferably a half-life stability of at least 6 

months at 22°C. An embodiment of the invention is a luciferase mutant having 
stability 6 hours at 65°C (equivalent to a half-life of 2 days). A loss of activity of 
about 5-6% was found. The half-lives of enzymes from the most stable clones of 
the present invention, extrapolated from data showing small relative changes, is 2 

25 days at 65°C (corresponding to 6% loss over 6 hours), and 2 years at 22°C 

(corresponding to 5% loss over 6 weeks). 

In particular, the invention comprises luciferase enzymes with 
embodiments of amino acid sequences disclosed herein, (e.g. mutant luciferases 
designated Luc49-7C6; Luc78-0B10; and Luc90-755, FIGS. 27, 36, 43) as well as 

30 all other beetle luciferases that have thermostability as measured in half-lives of at 
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least 2 hours at 50°C. The invention also comprises mutated polynucleotide 
sequences encoding luciferase enzymes containing any single mutation or any 
combination of mutations of the type and positions in a consensus region of beetle 
luciferase encoding sequences, disclosed herein, or the equivalents. The mutations 
5 are indicated in the sequences in FIGS. 22-47 by bold, underlined residues and are 

aligned with other beetle luciferase sequences in FIG. 19. 

Nucleotide sequences encoding beetle luciferases are aligned in FIG. 19. 
Eleven sequences found in nature in various genera and species within genera are 
aligned, including lucppe-2. Nucleotide sequences encoding three mutant 

10 luciferases of the present invention (Luc49-7C6; 78-0B10, 90-JB5) are also 

aligned. There are at least three mutations in each mutant luciferase that show 
increased thermostability. In general, mutations are not in the conserved regions. 
Conserved amino acids are those that are identical in all natural species at 
positions shown in FIG. 19. Consensus refers to the same amino acid occurring at 

15 more than 50% of the sequences shown in FIG. 19, excluding LucPpe2. 

DETAILED DESCRIPTION OF THE INVENTION 

The invention relates beetle luciferases that are characterized by high 
thermostability and are created by mutations made in the encoding genes, 
generally by recursive mutagenesis. The improved thermostability allows storage 

20 of luciferases without altering its activity, and improves reproducibility and 

accuracy of assays using the new luciferases. The invention further comprises 
isolated polynucleotide sequences (cDNAs) which encode the mutant luciferases 
with increased thermostability, vectors containing the polynucleotide sequences, 
and hosts transformed to express the polynucleotide sequences. Table 1 shows 

25 results of about 250 clones and characteristics of the luciferases from the clones 

including thermostability. The invention also encompasses the use of the mutant 
luciferases in any application where luciferases are conventionally utilized, and 
kits useful for some of the applications. 

Unexpectedly, beetle luciferases with the sought after high thermostability 

30 were achieved in the present invention through a process of recursive mutagenesis 
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and selection (sometimes referred to as "directed evolution"). A strategy of 
recursive mutagenesis and selection is an aspect of the present invention, in 
particular the use of a multi-parameter automated screens. Thus, instead of 
screening for only a single attribute such as thermostability, simultaneous 
5 screening was done for additional characteristics of enzyme activity and 

efficiency. By this method, one property is less likely to "evolve" at the expense 
of another, resulting in increased thermostability, but decreased activity, for 
example. 

Table 1 presents examples of parameter values (Li, Tau, K ra and S) derived 

10 from experiments using different luciferases as starting (parent) sequences. The 

subtitles refer to designations of the starting temperature at which the parameters 
were measured and the starting luciferase, e.g., 39-5B10 at 51°C" and so forth. 
All parameters in each experiment are recorded as relative values to the respective 
starting sequence, e.g., the parameter values for the starting sequence in any 

15 experiment equal "1 .** (See Example 2 herein for definitions.) 

Thermostability has evolved in nature for various enzymes, as evidenced 
by thermostable isozymes found in thermophilic bacteria. Natural evolution 
works by a process of random mutagenesis (base substitutions, gene deletions, 
gene insertions), followed by selection of those mutants with improved 

20 characteristics. The process is recursive over time. Although the existence of 

thermostable enzymes in nature suggests that thermostability can be achieved 
through mutagenesis on an evolutionary scale, the feasibility of achieving a given 
level of thermostability for a particular class of enzymes by using short term 
laboratory methods was unpredictable. The natural process of evolution, which 

25 generally involves extremely large populations and many millions of generations 

and genes, by mutation and selection cannot be used to predict the capabilities of a 
modern laboratory to produce improved genes by directed evolution until such 
mutants are produced. 

After such success, since the overall three-dimensional structure of all 

30 beetle luciferases are quite similar, having shown it possible for one member of 

this class makes it predictable that high thermostability can be achieved for other 

6 
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beetle luciferases by similar methods. FIG. 17 shows evolutionary relationship 
among beetles luciferases. All of these have a similar overall architecture. The 
structural class to which the beetle luciferases belong is determined by the 
secondary structure (e.g. helices are symbolized by cylinders, sheets by collections 

5 of arrows, loops connect helices with sheets (FIG. 1 8A). FIG. 1 8B shows the 

amino acids of the LucPpe2 luciferase (FIG. 18B) wherein small spirals 
correspond to cylinders of FIG. 18 A; FIG 18C shows that the general beetle 
architecture matches (is superimposed on) that of LucPpe2. This is support for the 
expectation that the methods of the present invention may be generalized to all 

10 beetles luciferases: 

Enzymes belong to different structural classes based on the three- 
dimensional arrangement of secondary elements such as helices, sheets, and loops. 
Thermostability is determined by how efficiently the secondary elements are 
packed together into a three-dimensional structure. For each structural class, there 

15 also exists a theoretical limit for thermostability. All beetle luciferases belong to a 

common structural class as evident by their common ancestry (FIG. 17), 
homologous amino acid sequences, and common catalytic mechanisms. 

The application of a limited number of amino acid substitutions by 
mutagenesis is unlikely to significantly affect the overall three-dimensional 

20 architecture (/.e., the structural class for mutant luciferases is not expected to 

change.) Because the theoretical limit for thermostability for any structural class 
is not known, the potential thermostability of beetle luciferases was not known 
until demonstrations of the present invention. 

A priori difficulties in achieving the goals of the present invention 

25 included: 

1 . The types of mutations which can be made by laboratory methods 
are limited. 

i) By random point mutation (e.g. by error-prone PCR), more than one 
base change per codon is rare. Thus, most potential amino acid 
30 changes are rare. 
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ii) Other types of random genetic changes are difficult to achieve for 
areas greater than 100 bp (e.g., random gene deletions or 
insertions). 

2. The number of possible luciferase mutants that can be screened is 

5 limited. 

i) Based on sequence comparisons of natural luciferases, ignoring 
deletions and insertions, more than 10 189 functional enzyme 
sequences may be possible. 

ii) If 1 00,000 clones could be screened per day, it would require 

10 more than 10 179 centuries to screen all possible mutants assuming 

same mutant was never screened twice (actual screening rate for 
the present invention was less than 5000 per day). 

3. The probability of finding functional improvement requiring 
cooperative mutations is rare (the probability of finding a specific cooperative pair 

15 is 1 out of 108 clones). 

Thus, even if the theoretical limits of thermostability were known, since 
only a very small number of the possible luciferase mutants can be screened, the a 
priori probability of finding such a thermostable enzyme was low. 

However, the present invention now shows that it is possible and feasible to 
20 create novel beetle luciferases having high thermostability. 

a) The approximately 250 mutants produced by methods of the present 
invention wherein the initial sequence was from LucPpe2 and 
LucPpe demonstrate that it is possible and feasible for at least one 
member of this enzyme class to achieve high thermostability. 
25 b) Any beetle luciferase should be improved by similar means since 

the luciferases belong to the same structural class, 
i) Since all beetle luciferases belong to the same structural 
class, they also share in the same pool of potentially 
stabilizing mutations (this conclusion is supported by 
30 observation that a high percentage of the stabilizing 

mutations found in the clones of the present invention were 
8 
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conversions to "consensus amino acids" in other beetle 
luciferases that is, amino acids that appear in the majority of 
beetle luciferase sequences (see FIG. 19). 
ii) Similar results were achieved using another beetle luciferase 
5 from the luminous beetle Pyrophorus plagiophthalamus 

(LucPplYG). The wild-type LucPplYG has 48% sequence 
identity to the wild type LucPpe2. Although the 
thermostability of the LucPplYG mutants were less than the 
LucPpe2 mutants described herein, this is because they were 
10 subjected to fewer cycles of directed evolution. Also, in 

some instances, mutants were selected with less emphasis 
placed on their relative thermostability. The most stable 
clone resulting from this evolution (LucS0-5£5) has a half- 
life of roughly 3.8 hours at 50°C. 
1 5 To compensate for a statistical effect caused by the large number of 

deleterious random mutations expected relative to the beneficial mutations, 
methods were employed to maximize assay precision and to re-screen previously 
selected mutations in new permutations. Among the methods for maximizing 
assay precision were closely controlling culture conditions by using specialized 
20 media, reducing growth rates, controlling heat transfer, and analyzing parameters 

from mid-logarithmic phase growth of the culture, controlling mixing, heat 
transfers, and evaporation of samples in the robotic screening process; and 
normalizing data to spatially distributed control samples. New permutations of the 
selected mutations were created by a method of DNA shuffling using proof- 
25 reading polymerases. 

The difficulty in predicting the outcome of the recursive process is 
exemplified by the variable success with the other characteristics of luciferase that 
were also selected for. Although the primary focus was on the enzyme 
thermostability, selection for mutants with brighter luminescence, more efficient 
30 substrate utilization, and an extended luminescence signal was also attempted. 

The definitions are given by equations herewith. The selection process was 

9 
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determined by changes relative to the parent clones for each iteration of the 
recursive process. The amount of the change was whatever was observed during 
the screening process. The expression of luciferase in E. coli was relatively 
inefficient, for LucPpe2 9 compared to Luc +. Other luciferases varied (see 
5 Fig. 21). 

To improve the overall efficiency of substrate utilization, reduction in the 
composite apparent utilization constant (i.e., Km-[ATP+luciferin]) for both 
luciferin and ATP was sought. Although there was an unexpected systematic 
change in each utilization constant, there was little overall change. Finally, the 

10 luminescence signal could only be moderately affected without substantially 

reducing enzyme efficiency. Thus, while the enzyme thermostability was greatly 
increased by methods of the present invention, other characteristics of the enzyme 
were much less affected. 

FIGS. 48-53 present other results of the mutant luciferases. Compositions 

15 of the invention include luciferases having greater than the natural level of 

thermostability. Each mutant luciferase is novel, because its individual 
characteristics have not been reported. Specific luciferases are known by both 
their protein and gene sequences. Many other luciferases were isolated that have 
increased, high thermostability, but whose sequences are not known. These 

20 luciferases were identified during the directed evolution process, and were 

recognized as distinct by their enzymological characteristics. 

A luciferase which is much more stable than any of the luciferase mutants 
previously described is designated as mutant Luc 90-1B5. New thermostable 
mutants were compared to this particularly stable luciferase. The mutant 

25 luciferases of the present invention display remarkable and heretofore unrealized 

thermostability at temperatures ranging from 22°C (room temperature) to at least 
as high as 65°C. 

Other aspects of the invention include methods that incorporate the 
thermostable luciferases, specifically beetle luciferases having high 
30 thermostability. 
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Production of Luciferases of the Present Invention 

The method of making luciferases with increased thermostability is 
recursive mutagenesis followed by selection. Embodiments of the highly 
thermostable mutant luciferases of the invention were generated by a reiterative 
5 process of random point mutations beginning with a source nucleotide sequence, 

e.g. the cDNA LucPpe2 [T249M] cDNA. Recombination mutagenesis is a part of 
the mutagenesis process, along with point mutagenesis. Both recombination 
mutagenesis and point mutagenesis are performed recursively. Because the 
mutation process causes recombination of individual mutants in a fashion similar 
10 to the recombination of genetic elements during sexual reproduction, the process is 

sometimes referred to as the sexual polymerase chain reaction (sPCR). See, for 
instance, Stemmer, U.S. Patent No. 5,605,793, issued February 25, 1997. 

Taking the LucPpe2 luciferase cDNA sequence as a starting point, the 
gene was mutated to yield mutant luciferases which are far more thermostable. A 
15 single point mutation to the LucPpe2 sequence yielded the luciferase whose 

sequence is depicted as T249M This mutant is approximately 5 times brighter in 
vivo than that of LucPpe2, it was utilized as a template for further mutation. It 
was also used a baseline for measuring the thermostability of the other mutant 
luciferases described herein. 

20 

Embodiments Of Sequences Of Luciferases Of The Present Invention 

FIG. 45 shows the amino acid sequence of the LucPpe2 luciferase. 
T249M. The sequence contains a single base pair mutation at position T249 to M 
(bold, underlined) which distinguishes it from the sequence reported by Leach et 

25 al f (1997). This clone has a spectral maximum of 552 nm, which is yellow 

shifted from that of the Luc of Leach. This mutant was selected for use as an 
original template in some of the Examples because it is approximately 5 times 
brighter in vivo, than the form repeated by Leach et al which allowed for more 
efficient screening by the assay. These sequences show changes from the starting 

30 sequence (T249-M) in bold face. Note that "x" in the sequence denotes an 

ambiguity in the sequence. 

11 
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Directed Evolution. A Recursive Process 

Directed evolution is a recursive process of creating diversity through 
mutagenesis and screening for desired changes. For enzymological properties that 
result from the cumulative action of multiple amino acids, directed evolution 
provides a means to alter these properties. Each step of the process will typically 
produce small changes in enzyme function, but the cumulative effect of many 
rounds of this process can lead to substantial overall change. 

The characteristic, "thermostability" is a candidate for directed evolution 
because it is determined by the combined action of many of the amino acids 
making up the enzyme structure. To increase the thermostability of luciferase, 
luminescence output and efficiency of substrate binding were also screened. This 
was to ensure that changes in thermostability did not also produce undesirable 
changes in other important enzymological properties. 

Because the frequency of deleterious mutations is much greater than useful 
mutations, it is likely that undesirable clones are selected in each screen within the 
precision limits of the present invention. To compensate for this, the screening 
strategy incorporated multiple re-screens of the initially selected mutations. 
However, before re-screening, the selected mutations were "shuffled" to create a 
library of random intragenetic recombinations. This process allows beneficial 
mutations among different clones to be recombined together into fewer common 
coding sequences, and unlinks deleterious mutations to be segregated and omitted. 
Thus, although essentially the same set of selected mutations was screened again, 
they were screened under different permutations as a result of the recombination 
or shuffling. 

Although results of each step of the evolutionary process were assayed by 
quantitative measurements, these measurements were mutually made in cell 
lysates rather than in purified enzymes. Furthermore, each step only measured 
changes in enzyme performance relative to the prior step, so global changes in 
enzyme function were difficult to judge. To evaluate the impact of directed 
evolution on enzyme function, clones from the beginning, middle and end of the 

12 
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process (Table 2) were purified and analyzed. The clones selected for this analysts 
were Luc[T249M], 49-7C6, and 78-0B10. Another clone, 90-1B5, created by a 
subsequent strategy of oligonucleotide-directed mutagenesis and screening was 
also purified for analysis. 

5 The effect of directed evolution on thermostability was dramatic. At high 

temperatures, where the parent clone was inactivated almost instantaneously, the 
mutant enzymes from the related clones showed stability over several hours (Table 
1). Even at room temperature, these mutants are several fold more stable than the 
parent enzyme. Subsequent analysis of 90-1B5 showed this enzyme to be the 

10 most stable, having a half-life of 27 hours at 65°C when tested under the same 

buffer conditions. With some optimization of buffer conditions, this enzyme 
showed very little activity loss at 65°C over several hours (citrate buffer at pH 6.5; 
FIG. 1 A). This luciferase was stable at room temperature over several weeks 
when incubated at pH 6.5 (FIG. IB). 

15 Kajiyama and Nakamo (1 993) showed that firefly luciferase from Luciola 

lateralis was made more stable by the presence of a single amino acid substitution 
at position A2 1 7; to either I, L, or V. The substitution was from alanine. 
Substitution with leucine produced a luciferase that maintained 70% of its activity 
after incubation for 1 hour at 50°C. All of the enzymes of the present invention 

20 created through directed evolution, are much more stable than this L. lateralis 

mutant. The most stable clone, 90-1B5, maintains 75% activity after 120 hours (5 
days) incubation under similar conditions (50°C, 25mol/L citrate pH 6.5, 150 
mmol/L NaCl, lmg/mL BSA, O.lmmol/L EDTA, 5% glycerol). Interestingly, the 
Luc reported by Leach already contains isoleucine at the homologous position 

25 described for the L. lateralis mutant. 

Although thermostability was the characteristic of interest, clones were 
selected based on the other enzymological parameters in the screens. By selecting 
clones having greater luminescence expression, mutants were found that yielded 
greater luminescence intensity in colonies of E. coli. However, the process 

30 showed little ability to alter the kinetic profile of luminescence by the enzymes. 

This failure suggests that the ability to support steady-state luminescence is 
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integral to the catalytic mechanism, and is not readily influenced by a cumulative 
effect of many amino acids. 

Substrate binding was screened by measuring an apparent composite km 
(see Example 2) for luciferin and ATP. Although the apparent composite 
remained relatively constant, later analysis showed that the individual K^'s 
systematically changed. The K m for luciferin rose while the K m for ATP declined 
(Table 2). The reason for this change is unknown, although it can be speculated 
that more efficient release of oxyluciferin or luciferin inhibitors could lead to more 
rapid enzyme turnover. 

Each point mutation on its own increases (to a greater or lesser extent) the 
thermostability of the mutant enzyme beyond that of the wild-type luciferase. The 
cumulative effect of combining individual point mutations yields mutant 
luciferases whose thermostability is greatly increased from the wild-type, often on 
the order of a magnitude or more. 

EXAMPLES 

The following examples illustrate the methods and compositions of the 
present invention and their embodiments. 

EXAMPLE 1: Producing Thermostable Luciferases Of The Present 
Invention 

Mutagenesis Method : 

An illustrative mutagenesis strategy is as follows: 
From the "best" luciferase clone, that is a clone with improved 
thermostability and not appreciably diminished values for other parameters, 
random mutagenesis was performed by three variations of error-prone PCR. From 
each cycle of random mutagenesis, 18 of the best clones were selected. DNA was 
prepared from these clones yielding a total of 54 clones. These clones represent 
new genetic diversity. 
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These 54 clones were combined and recombination mutagenesis was 
performed. The 1 8 best clones from this population were selected. 

These 1 8 clones were combined with the 18 clones of the previous 
population and recombination mutagenesis was performed. From this screening, a 
5 new luciferase population of 1 8 clones was selected representing 6 groups of 

functional properties. 

In this screening the new mutations of the selected 54 clones, either in their 
original sequence configurations or in recombinants thereof, were screened a 
second time. Each mutation was analyzed on the average about 10 times. Of the 
10 90 clones used in the recombination mutagenesis, it was likely that at least 10 

were functionally equivalent to the best clone. Thus, the best clone or 
recombinants thereof should be screened at least 100 times. Since this was greater 
than the number of clones used in the recombination, there was significant 
likelihood of finding productive recombination of the best clone with other clones. 

15 Robotic Processing Methods : 

Heat transfers were controlled in the robot process by using thick 
aluminum at many positions where the 96-well plates were placed by the robotic 
arm. For example, all shelves in the incubators or refrigerator were constructed 
from V4 inch aluminum. One position in particular, located at room temperature, 

20 was constructed from a block of aluminum of dimensions 4.5 x 7 x 6.5 inches. 

When any 96-well plate was moved from a high temperature (e.g, incubators) or 
low temperature (e.g., refrigerator) to a device at room temperature, it was first 
placed on the large aluminum block for temperature equilibration. By this means, 
the entire plate would rapidly reach the new temperature, thus minimizing unequal 

25 evaporation for the various wells in the plate due to temperature differences. Heat 

transfers in a stack of 96-well plates placed in an incubator (e.g., for overnight 
growth of E. coli) were controlled by placing 1 mm thick sheets of aluminum 
between the plates. This allowed for more efficient heat transfer from the edges of 
the stack to the center. Mixing in the robotic process was controlled by having the 

30 plate placed on a shaker for several second after each reagent addition. 
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Please refer to FIG. 14 for a schematic of the order in which the plates are 
analyzed (FIG. 1 5) and a robotic apparatus which can be programmed to perform 
the following functions: 

5 Culture Dilution Method. A plate (with lid) containing cells is placed on a shaker 

and mixed for 3-5 minutes. 

A plate (with lid) is gotten from a carousel and placed in the reagent 
dispenser. 1 80 p.1 of media is added after removing the lid and placing on the 
locator near the pipetter. The plate is then placed in the pipetter. 
10 The plate on the shaker is placed in the pipetter, and the lid removed and 

placed on the locator. Cells are transferred to the new plate using pipetting 
procedure (see "DILUTION OF CELLS INTO NEW CELL PLATE"). 

The lids are replaced onto both plates. The new plate is placed in the 
refrigerator and the old plate is returned to the carousel. 

15 Luminescence Assay Method. A plate containing cells is retrieved from the 

carousel and placed on the shaker for 3-5 minutes to fully mix the cells, the 
cells tend to settle from solution upon standing. 

To measure Optical Density (O.D.), the plate is moved from the shaker to 
the locator near the luminometer; the lid is removed and the plate placed into the 
20 luminometer. The O.D. is measured using a 620 nm filter. 

When it is finished, the plate is then placed in the refrigerator for storage. 
The above steps are completed for all plates before proceeding with 
subsequent processing. 

To prepare a cell lysate, the plate of cells is first retrieved from the 
25 refrigerator and mixed on the shaker to resuspend the cells. A new plate from the 

carousel without a lid is placed in the reagent dispenser and 20 [il of Buffer A is 
added to each well. This is placed in the pipetting station. 

The plate of cells in the shaker is placed in the pipetting station. A 
daughter plate is prepared using pipetting procedure (see "PIPETTING CELLS 
30 INTO THE LYSIS PLATE") to prepare a daughter plate of cells. 
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After pipetting, the new daughter plate is placed on the shaker for mixing. 
The plate is returned to its original position in the carousel. 

After mixing, the Lysate Plate is placed into the C0 2 freezer to freeze the 
samples. The plate is then moved to the thaw block to thaw for 10 minutes. 
5 The plate is then moved to the reagent dispenser to add 1 75 \xl of Buffer B, 

and then mixed on the shaker for about 15 minutes or more. The combination of 
the freeze/thaw and Buffer B will cause the cells to lyse. 

A new plate with a lid from the carousel is used to prepare the dilution 
plate from which all assays will be derived. The plate is placed in the reagent 
10 dispenser and the lid removed to the locator near the pipetter. 285 u:l of Buffer C 

is added to each well with the reagent dispenser, then the plate is placed in the 
pipetting station. 

The Lysate Plate in the shaker is moved to the pipetting station and 
pipetting procedure (see "DILUTION FROM LYSIS PLATE TO INCUBATION 

15 PLATE") is used. After pipetting, the new daughter plate is placed on the shaker 

for mixing. The Lysate Plate is discarded. 

Two white assay plates are obtained from the plate feeder and placed in the 
pipetter. The incubation plate from the shaker is placed in the pipetter, and the lid 
removed and placed on the nearby locator. Two daughter plates are made using the 

20 pipetting procedure (see CREATE PAIR OF DAUGHTER PLATES FROM 

INCUBATION PLATE"). Afterwards, the lid is replaced on the parent plate, and 
the plate is placed in a high temperature incubator, [ranging from 3 1 ° to about 65°. 
depending on the clone.] 

One daughter plate is placed in the luminometer and the lx ASSAY 

25 METHOD is used. After the assay, the plate is placed in the ambient incubator, 

and the second daughter plate is placed in the luminometer. For the second plate, 
the 0.02x ASSAY METHOD is used. This plate is discarded, and the first plate is 
returned from the incubator to the luminometer. The REPEAT ASSAY method is 
used (i.e., no reagent is injected). Afterwards, the plate is again returned to the 

30 ambient incubator. 
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The above steps are completed for all plates before proceeding with 
processing. 

To begin the second set of measurements, the plate from the high 
temperature incubator is placed in the shaker to mix. 
5 The plate in the ambient incubator is returned to the luminometer and the 

REPEAT ASSAY method is again used. The plate is returned afterwards to the 
ambient incubator. 

Two white assay plates again are obtained from the plate feeder and placed 
in the pipetter. The plate on the shaker is placed in the pipetter, and the lid 
10 removed and placed on the nearby locator. Two daughter plates are again made 

using the pipetting procedure (see "CREATE PAIR OF DAUGHTER PLATES 
FROM INCUBATION PLATE"). Afterwards, the lid is replaced on the parent 
plate, and the plate is returned to the high temperature incubator. 

One daughter plate is placed in the luminometer and the Ix ASSAY 
15 METHOD is again used. The plate is discarded after the assay. The second 

daughter plate is then placed in the luminometer and the 0.06x ASSAY METHOD 
is used. This plate is also discarded. 

The above steps are completed for all plates before proceeding with 
processing. 

20 In the final set of measurements, the plate from the high temperature 

incubator is again placed in the shaker to mix. 

The plate in the ambient incubator is returned to the luminometer and the 
REPEAT ASSAY method is again used. The plate is discarded afterwards. 

One white assay plate is gotten from the plate feeder and placed in the 
25 pipetter. The plate from the shaker is placed in the pipetter, and the lid removed 

and placed on the nearby locator. One daughter plate is made using the pipetting 
procedure (see "CREATE SINGLE DAUGHTER PLATE FROM INCUBATION 
PLATE"). The lid is replaced on the parent plate and the plate is discarded. 
The daughter plate is placed in the luminometer and the lx ASSAY 
30 METHOD is used. The plate is discarded after the assay. 
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Buffers: 



Buffer A : 
25mM K2HP04 
5 .5mM CDTA 

.1% Triton X-100 

Buffer B : 

X CCLR (Promegael53a) 
10 1.25mg/ml lysozyme 

0.04% gelatin 

Buffer C : 
lOmM HEPES 
15 ISOmMNaCl 
lmg/ml BSA 
5% glycerol 
0.1 mM EDTA 

20 1 X Assay reagent: 

5uM Luciferin 
175uM ATP 
20mM Tricine , pH 8.0 
0.1 mM EDTA 

25 0.02X Assay reagent: 

1:50 dilution of IX Assay reagent 

0.06X Assay reagent: 

1 : 1 50 dilution of 1 X Assay reagent 
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Pipetting Procedures 
Pipetting Cells Into the Lysis Plate 
Non-aseptic procedure using fixed tips 

On the pipetter deck : 

5 -place a plate containing approximately 200 \x\ cells without lid 

-Lysate Plate containing 20 \i\ of Buffer A 

Procedure: 

1. Move the tips to the washing station and wash with 1 ml. 

2. Move to the cell plate and withdraw 60 jil. 
10 3. Move to the Lysate Plate and dispense 45 

4. Repeat steps 1-3 for all 96 samples. 

5. At the conclusion of the procedure, step 1 is repeated to clean the tips. 
Post-procedure: 

- Place Lysate Plate onto the shaker. 

15 - Place lid on plate with cells and place on carousel. 

- Place Lysate Plate into the C0 2 freezer. 



DILUTION FROM LYSIS PLATE TO INCUBATION PLATE 

20 On the pipetter deck : 

- Lysate Plate containing 240 ^1 of lysate 

- Incubation Plate without lid containing 285 ^1 of Buffer C 
Procedure: 

1 . Move the tips to the washing station and wash with 0.5 ml. 
25 2. Move to the Lysate Plate and withdraw 30 \il 

3. Move to the Incubation Plate and dispense 15 ^il by direct contact with 
the buffer solution. 

4. Repeat steps 1 -3 for all 96 samples. 

5. At the conclusion of the procedure, step 1 is repeated to clean the tips. 
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Post-procedure: 

- Place Incubation Plate on shaker. 

- Discard Lysate Plate. 

5 CREATE PAJR OF DAUGHTER PLATES FROM INCUBATION PLATE 

This procedure is done twice 
On the pipetter deck: 

- Incubation Plate containing 100-300 |il of solution without lid 

- Two empty Assay Plates (white) 

10 Procedure: 

1 . Move the tips to the washing station and wash with 0.5 ml. 

2. Move to the Incubation Plate and withdraw 50 [il 

3. Move to the first Assay Plate and dispense 20 

4. Move to the second Assay Plate and dispense 20 
15 5. Repeat steps 1-4 for all 96 samples. 

6. At the conclusion of the procedure, step 1 is repeated to clean the tips. 

Post-procedure: 

1 . Replace lid on Incubation Plate. 

2. Place Incubation Plate in incubator. 

20 3. Place first Assay Plate in luminometer. 

4. Place second Assay Plate on carousel. 



CREATE SINGLE DAUGHTER PLATE FROM INCUBATION PLATE 
25 On the pipetter deck : 

Place incubation Plate containing 100-300 \i\ of solution without lid 

and 

Empty Assay Plate (white) 
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Procedure: 

1 . Move the tips to the washing station and wash with 0.5 ml. 

2. Move to the Incubation Plate and withdraw 40 

3. Move to the Assay Plate and dispense 20 \x\. 

4. Repeat steps 1-3 for all 96 samples. 

5. At the conclusion of the procedure, step 1 is repeated to clean the tips. 
Post-procedure: 

- Discard Incubation Plate and lid on Incubation Plate. 

- Place Assay Plate in luminometer. 



DILUTION OF CELLS INTO NEW CELL PLATE 
Aseptic procedure using fixed tips 
On the pipetter deck : 

- plate containing approximately 200 |il of cells without lid 
15 - new cell plate containing 180 of Growth Medium without lid 

Procedure: 

1 . Move to the cell plate and withdraw 45 

2. Move to the Cell Plate and dispense 20 \i] volume by direct liquid-to- 
liquid transfer. 

20 3. Move to waste reservoir an expel excess cells. 

4. Move to isopropanol wash station aspirate isopropanol to sterilize tips. 

5. Move to wash station, expel isopropanol and wash tips. 

6. Repeat steps 1-4 for all 96 samples. 

Post-procedure: 

25 1 . Replace lid on original plate of cells and place onto carousel. 

2. Replace lid on new cell plate and place into refrigerator. 

Notes : 

This procedure is used to prepare the cell plates used in the main analysis 
procedure. 
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40 



180 of Growth Medium is added by the reagent dispenser to each of the 
new cell plates just prior to initiating the pipetting procedure. 

The dispenser is flushed with 75% isopropanol before priming with 
medium. 

The medium also contains selective antibiotics to reduce potential 
contamination. 
Luminometer Procedures 

Ix ASSAY METHOD 

- place plate into luminometer 



1 . Inject 1 00 ul of 1 X Assay reagent 

2. Measure luminescence for 1 to 3 seconds 
15 3 . Repeat for next wel I 

- continue until all wells are measured 



20 0.02x ASSAY METHOD 

- place plate into luminometer 

1 . Inject 100 ul of 0.02X Assay reagent 

25 2. Measure luminescence for 1 to 3 seconds 

3 . Repeat for next wel 1 

- continue until all wells are measured 

30 

0.06x ASSAY METHOD 

- place plate into luminometer 

35 1 . Inject 1 00 ul of 0.06X Assay reagent 

2. Measure luminescence for 1 to 3 seconds 

3. Repeat for next well 



- continue until all wells are measured 



REPEAT ASSAY 
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- place plate into luminometer 

1. Measure luminescence for 1 to 3 seconds 
5 2. Repeat for next wel 1 

- continue until all wells are measured 



IN VIVO SELECTION METHOD 
10 5-7 nitrocellulose disks, 200-500 colonies per disk (1000-3500 colonies 

total), are screened per 2 microplates (176 clones). The clones are screened at 

high temperatures using standard screening conditions. 

8 positions in each microplate are reserved from a reference clone using the 

"best" luciferase (the parent clone for random mutagenesis and codon 
15 mutagenesis). The positions of the reserved wells is shown as "X" below. 



XooooooooooX 
oooooooooooo 

20 000X0000X000 

oooooooooooo 
oooooooooooo 
000X0000X000 
oooooooooooo 

25 XooooooooooX 



The reference clones are made by placing colonies from DNA transformed 
from the parent clone into the reference wells. (To identify these wells prior to 
inoculation of the microplate, the wells are marked with a black marking pen on 
30 the bottom of each well). 
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Wood and 

SCREENING SELECTION CRITERIA 

The following were used to screen. Criteria 1 is achieved manually; data 
for criteria 2-6 is generated by robotic analysis. For all criteria, the maximum 
value as described are selected. 
5 1 . In vivo screen. The brightest clones are selected at an elevated 

temperature. 

2. Expression/specific activity. The value of normalized luminescence 
are calculated as the ratio of luminescence to optica] density. The 
values are reported as the ratio with the reference value. 

10 3. Enzyme stability. Measurements of normalized luminescence of the 

incubated samples (3 taken over about 1 5 hours) are fitted to 
ln(L)=ln(L0)-(t/r ), where L is normalized luminescence and t is time. 
r is a measure of the enzyme stability. The value is reported as the 
ratio with the reference value, and the correlation coefficients are 

15 calculated. 

4. Substrate binding. Measurements of normalized luminescence with 
lx and 0.02x are taken at the initial reading set, and Ix and 0.06x are 
taken at the 5 hour set. The ratio of the 0.02x: lx and 0.06x: 1 x gives 
the relative luminescence at 0.02x and 0.06x concentrations. These 

20 values, along with the relative luminescence at lx (i.e., 1), are fitted 

to a Lineweaver-Burk plot to yield the Km:app,total for the substrates 
ATP, luciferin, and CoA. The value are reported as the inverse ratio 
with the reference value, and the correlation coefficients are 
calculated. 

25 5. Signal stability. The luminescence of the initial lx luminescent 

reaction are re-measured 3 additional times over about 15 hours. 
These values are fitted to ln(L)=In(L0)-(t/r ) and the integral over t 
(15 hours) are calculated. Signal stability is then calculated as S=(l- 
int(L)/L0t)2. The value are reported as the inverse ratio with the 

30 reference value, and the correlation coefficient are calculated. 
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6. Composite fitness. The values of criteria 2 through 5 are combined 
into a single composite value of fitness (or commercial utility). This 
value is based on a judgment of the relative importance of the other 
criteria. This judgment.is given below: 

5 

Criteria Relative Value 

Stability 5 
Signal Stability 2 
Substrate Binding 2 
10 Expression/Activity 1 

The composite, C=Sum(criteria 2-5 weighted by relative value, e.g., more 
weight is on stability because that was a major goal). 

EXAMPLE 2: Software 

Procedure: Organize data into SQL database. Each file created by a 
15 luminometer (96 well) (Anthos, Austria) represents the data from one microplate. 

These files are stored in the computer controlling the luminometer, and connected 
to the database computer by a network link. From each microplate of samples, 
nine microplates are read by the luminometer (the original microplate for optical 
density and eight daughter microplates for luminescence). 
20 Ninety files are created in total; each containing data sets for 96 samples. Each 

data set contains the sample number, time of each measurement relative to the first 
measurement of the plate, luminometer reading, and background corrected 
luminometer reading. Other file header information is also given. The time that 
each microplate is read is also be needed for analysis. This can be obtained from 
25 the robot log or the file creation time. A naming convention for the files are used 

by the robot during file creation that can be recognized by SQL (e.g. 
YYMMDDPR.DAT where YY is the year, MM is the month, DD is the day, P is 
the initial plate [0-9], and R is the reading [0-8]). 
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Procedure: Data Reduction And Organization. 

- Normalize luminescence data: For each measurement of luminescence in the 
eight daughter plates, the normalized luminescence is calculated by dividing by 
the optical density of the original plate. If any value of normalized luminescence 

5 is less than zero, assign the value of 0. 1 sL where sL is the standard deviation for 

measurements of normalized luminescence. 

- Calculate relative measurement time: For each normalized luminescence 
measurement, the time of the measurement is calculated relative to the first 
measurement of the sample. For example, the time of all luminescence 

10 measurements of sample B6 in plate 7 (i.e., 7:B06) are calculated relative to the 

first reading of 7:B06. This time calculation will involve both the time when the 
plate is read and the relative time of when the sample is read in the plate. 

- Calculate enzyme stability ( r ): For each sample, use linear regression to fit 
ln(Lix)=ln(Lo)-(t/r ) using the three luminescence measurements with lx substrate 

15 concentrations (Plates 1, 5, 8). Also calculate the regression coefficient. 

- Calculate substrate binding (K ro:app , lotaJ ): Using microplates from the first set of 
readings (Plates 1 and 2), calculate the Lo.2x.rei by dividing measurements made 
with substrate concentrations of 0.02x by those of lx. Similarly, calculate the 
Lo.o6x.rei using microplates of the second set of readings (Plates 5 and 6), by 

20 dividing measurements made with substrate concentrations of 0.06x by those of 

lx. 

For each sample, use linear regression to fit l/LKK^pp^,^^) 
(1/[S])+(1/L max:app ) using 

L [S] 
25 Lo.02 V d 0.02 

Lo.06 V el 0.06 
1 (Lix^el) 1 

K m :ap P .totai is calculated as the slope/intercept. Also calculate the regression 
coefficient. 

30 
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- Calculate signal stability (S): For each sample, use linear regression to fit 
ln(L)=ln(Lo)-(t/r ) using the four luminescence measurements of the initial 
microplate with lx substrate concentrations (Plates 1, 3, 4, and 7). Also calculate 
the regression coefficient. From the calculated values of r and Lo, calculate the 

5 integral of luminescence by int(L)= r Lo (1 -exp(-t/r )), where t/\s the average time 

of the last measurement (e.g., 15 hours). The signal stability is calculated as S=(l- 
int(L)/L,t/) 2 , where L, is the initial measurement of normalized luminescence with 
lx substrate concentration (Plate 1) 

[Note: To correct for evaporation, an equation S=(l+K-int(L)/L,t/) 2 , may be used 
10 where 1/K=2(relative change of liquid volume at t/).] 

- Calculate the reference value surfaces: A three dimensional coordinate system 
can be defined by the using the grid positions of the samples within a microplate 
as the horizontal coordinates, and the calculated values for the samples (L„ , 
K m:a p P> totai t > or S) as the vertical coordinate. This three dimensional system is 

15 referred to as a "plate map". A smooth surface in the plate maps representing a 

reference level can be determined by least squares fit of the values determined for 
the 8 reference clones in each microplate. For each of the 10 initial microplates of 
samples, respective reference surfaces are determined for the criteria parameters 
L„ r , K ra:app>lota i, and S (40 surfaces total). 

20 In the least squares fit, the vertical coordinate (i.e., the criteria parameters) are the 

dependent variable, the horizontal coordinates are the independent variables. A 
first order surface (i.e., z=ax+by+c) are fitted to the values of the reference clones. 
After the surface is calculated, the residuals to each reference clone are calculated. 
If any of these residuals is outside of a given cutoff range, the reference surface 

25 are recalculated with omission of the aberrant reference clone. 

If a first order surface does not sufficiently represent the values of the reference 
clones, a restricted second order surface are used (i.e., z=a (x 2 +ky 2 )+bx+cy+d, 
where k is a constant). 



28 



WO 99/14336 



PCT/US98/19494 



- Calculate the reference-normalized values: For the criteria parameter of each 
sample, a reference-normalized values is determined by calculating the ratio or 
inverse ratio with the respective reference value. The reference-normalized values 
are L//L ir , r It r , KmJK m:appJotQh and S/S, where reference values are calculated 

5 from the equations of the appropriate reference surface. 

- Calculate the composite scores: For each sample, calculate 
C=5(r It r )+2(S r /S)+2(K mr /K m:app , oUll )-KL i /L ir ). 

- Determine subgroupings: For the criteria parameters L», r , K m:apPtlota |, S, and C, 
delimiting values (i.e., bin sizes) for subgroupings are defined as gL, gr , gKm, 

10 gS, and gC. Starting with the highest values for Lj, r , or C, or the lowest values of 

K m:a p P .tota! or S, the samples are assigned to bins for each criteria parameter (the 
first bin being #1, and so on). 

- Display sorted table of reference-normalized values: Present a table of data 
for each sample showing in each row the following data: 

15 - sample identification number (e.g., 7:B06) 

- composite score (C) 

- reference-normalized enzyme stability (r/rr) 

- correlation coefficient for enzyme stability 

- bin number for enzyme stability 

20 - reference-normalized signal stability (SyS) 

- correlation coefficient for signal stability 

- bin number for signal stability 

- reference-normalized substrate binding (Ke^/Kn^pp^) 

- correlation coefficient for substrate binding 
25 - bin number for substrate binding 

- reference-normalized expression/specific activity (L/Ljr) 

- bin number for expression/specific activity 

The table is sorted by the composite score (C). 



29 



WO 99/14336 



PCT/US98/19494 



Procedure: Present sorted table of criteria parameters. 

Present a table of data for each sample showing in each row the following data: 

- sample identification number 

- composite score (C) 
5 - enzyme stability (r ) 

- correlation coefficient for enzyme stability 

- bin number for enzyme stability 

- signal stability (S) 

- correlation coefficient for signal stability 
10 - bin number for signal stability 

- substrate binding (K m:apPt totai) 

- correlation coefficient for substrate binding 

- bin number for substrate binding 

- expression/specific activity (Li) 

15 - bin number for expression/specific activity 

The table is sorted by the composite score (C); the reference clones are 
excluded from the table. Same entry coding by standard deviation as described 
above. 

Procedure: Present sorted table of reference-normalized values. 

20 This is the same procedure as the final step of the data reduction procedure. The 

table will show: 

- sample identification number 

- composite score (C) 

- reference-normalized enzyme stability (r It r) 
25 - correlation coefficient for enzyme stability 

- bin number for enzyme stability 

- reference-normalized signal stability (S r /S) 

- correlation coefficient for signal stability 

- bin number for signal stability 
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- reference-normalized substrate binding (K m /K TO:app> t ota |) 

- correlation coefficient for substrate binding 

- bin number for substrate binding 

- reference-normalized expression/specific activity (L/L^) 
5 - bin number for expression/specific activity 

The table is sorted by the composite score (C); the reference clones are 
excluded from the table. Same entry coding by standard deviation as described 
above. 

Procedure: Present sorted table of criteria parameters for reference clones. 
10 This is the same procedure as described above for criteria parameters, except for 

only the reference clones. The table will show: 

- sample identification number 

- composite score (C) 

- enzyme stability (r) 

15 - correlation coefficient for enzyme stability 

- bin number for enzyme stability 

- signal stability (S) 

- correlation coefficient for signal stability 

- bin number for signal stability 
20 - substrate binding (K m:apPttotal ) 

- correlation coefficient for substrate binding 

- bin number for substrate binding 

- expression/specific activity (L;) 

- bin number for expression/specific activity 

25 The table is sorted by the composite score (C). Same entry coding by 

standard deviation as described above. 

Procedure: Present sorted table of reference-normalized values. 

This is the same procedure as described above for reference-normalized values, 
except for only the reference clones. The table will show: 
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- sample identification number 

- composite score (C) 

- reference-normalized enzyme stability (r/r r ) 

- correlation coefficient for enzyme stability 

- bin number for enzyme stability 

- reference-normalized signal stability (Sr/S) 

- correlation coefficient for signal stability 

- bin number for signal stability 

- reference-normalized substrate binding (Kmr/K^pp^,) 

- correlation coefficient for substrate binding 

- bin number for substrate binding 

- reference-normalized expression/specific activity (L/Lir) 

- bin number for expression/specific activity 

The table is sorted by the composite score (C). Same entry coding by 
15 standard deviation as described above. 

Procedure: Sort table. 

Any table may be sorted by any entries as primary and secondary key. 
Procedure: Display histogram of table* 

For any table, a histogram of criteria parameter vs. bin number may be displayed 
20 for any criteria parameter. 

Procedure: Display plate map. 

For any plate, a plate map may be displayed showing a choice of: 

- any luminescence or optical density measurement 
-U 

25 - L, reference surface 

- T 

- r reference surface 
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-r/r r 

- correlation coefficient of r 
-S 

- S reference surface 
5 -S/S 

- correlation coefficient of S 

" Kfn:app,lota1 

- K m reference surface 

* K-ntf/K-^appjotai 

1 0 - correlation coefficient for K m:app<tola , 

- composite score (C) 

The plate maps are displayed as a three dimensional bar chart. Preferably, the 
bars representing the reference clones are indicated by color or some other means. 

Procedure: Display drill-down summary of each entry. 

15 For Lj, r , K m:apPttotal , and S, any entry value in a table may be selected to display 

the luminescence and optical density reading underlying the value calculation, and 
a graphical representation of the curve fit where appropriate. Preferably the 
equations involved and the final result and correlation coefficient will also be 
display. 

20 - Lj or Lj/L r . Display the optical density and luminescence value from the 

chosen sample in Plate 0 and Plate 1 . 

- r or r It r - Display the optical density and luminescence value from the 

chosen sample in Plate 0, Plate 1, Plate 5, and Plate 8. Display graph of ln(Llx) 

vs. t, showing data points and best line. 
25 - S or Sr/S. Display the optical density and luminescence value from the 

chosen sample in Plate 0, Plate 1, Plate 3, Plate 4, and Plate 7. Display graph of 

ln(L) vs. t, showing data points and best line. 

Display the optical density and luminescence 

value from the chosen sample in Plate 0, Plate 1, Plate 2, Plate 5, and Plate 6. 
30 Display graph of 1 /L vs. 1 /[S], showing data points and best line. 
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EXAMPLE 3: Preparation Of Novel Luciferases 

The gene with FIG. 1 contains a single base pair mutation at position 249, 
T to M. This clone has a spectral maximum of 552 nm which is yellow shifted 
from the sequence of Luc. This mutant was selected as an original template 
because it is about 5 time brighter in vivo which allowed for more efficient 
screening. 

C-terminus mutagenesis 

To eliminate the peroxisome targeting signal (-SKJL) the L was mutated to 
a STOP and the 3 codons immediately upstream were randomized according to the 
oligonucleotide mutagenesis procedure described herein. The mutagenic 
oligonucleotide designed to accomplish this also introduces a unique Spel site to 
allow mutant identification without sequencing. The mutants were screened in 
vivo and 13 colonies picked, 12 of which contained the Spel site. 

N-terminus mutagenesis 

To test if expression could be improved, the 3 codons immediately 
downstream from the initiation Met were randomized as described herein. The 
mutagenic oligo designed to accomplish this also introduces a unique Apal site to 
allow mutant identification without sequencing. Seven clones were selected, and 
six of the isolated plasmids were confirmed to be mutants. 

Shuffling of C- and N-terminus mutants 

The C- and N-terminus mutagenesis was performed side-by-side. To 
combine the N and C-terminus mutations, selected clones from each mutagenesis 
experiment were combined with the use of recombination mutagenesis according 
to the recombination mutagenesis protocol described herein. The shuffled mutants 
were subcloned into amp s pRAM backbone and screened in DH5 FIQ. [BRL, 
Hanahan, 1985) A total of 24 clones were picked, only 4 contained both theN- 
and C- terminus mutations. These 4 clones were used as templates for 
randomization of the cysteine positions in the gene. 
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Mutagenesis to randomize cysteine positions/Random mutagenesis and 
recombination mutagenesis in the Luc gene 

There are 7 cysteine positions in the Ppe-2 gene. It is known that these 

positions are susceptible to oxidation which could cause destabilization of the 

5 protein. Seven oligonucleotides were ordered to randomize the cysteine positions. 

The oligonucleotides were organized into two groups based upon the 

conservation of cysteine in other luciferase genes from different families. Group 1 

randomizes the conserved cysteine positions C-60, C-80, and C-162. Group 2 

randomizes cysteines that are not strictly conserved at positions C-38, C-127, C- 

10 221,andC-257. 

The four selected templates from the N and C terminus mutagenesis were 
sub-cloned into an ampicillin-sensitive backbone and single-stranded DNA was 
prepared for each of the templates. These templates were combined in equal 
amounts and oligonucleotide mutagenesis was completed as described herein. It 

15 was determined by plating an aliquot of the mutS transformation prior to overnight 

incubation that each of the 2 groups contained 2xl0 4 independent transformants. 
MutS-DNA was prepared for the 2 groups and was then transformed into JM109 
cells for screening. Mutants from group 1 were screened in vivo and picks were 
made for a full robotic run. Five clones were selected that had improved 

20 characteristics. Mutants from group 2 were screened in vivo and picks were made 

for a full robotic run. The temperature incubator on the robot was set at 33°C for 
this set of experiments. Ten clones were selected that had improved 
characteristics. 

The fifteen best picks from both groups of the cysteine mutagenesis 
25 experiments were shuffled together as described herein and 1 8 of the best clones 

were selected after robotic processing. 

The "best" clone from the above experiment (31-1G8) was selected as a 
template for subsequent rounds of mutagenesis. (The high temperature robot 
incubator temperature was set to 42°C) Another complete round of mutagenesis 
30 was completed. 
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The 18 best clones from the above mutagenesis were picked and clone (39- 
5B10) was selected as the best clone and was used as a template for another round 
of mutagenesis. (The high temperature robot incubator temperature was set at 
49°C). 

5 After this cycle, 6 of the best clones were selected for sequencing. Based 

upon the sequence data, nine positions were selected for randomization and seven 
oligos were designed to cover these positions. Based upon data generated from 
the robot, it was determined that the best clone from the group of six clones that 
were sequenced was clone (49-7C6). The luciferase gene from this clone was sub- 

10 cloned into an ampicillin-sensitive pRAM backbone and single stranded DNA was 

prepared. The randomization of the selected positions was completed according to 
the oligonucleotide mutagenesis procedure listed above. 

The randomization oligos were divided into 4 groups, and transformants 
from these experiments were picked and two robotic runs were completed. Ten 

15 clones were selected from the two experiments. (The high temperature robot 

incubator temperature on robot was set at 56°C). 

The best 10 picks from the above two experiments, and the best 18 picks 
from the previous population of clones were shuffled together (recombination 
mutagenesis protocol). 

20 The 1 8 best clones were selected and clone 58-0A5 was determined to be 

the best clone. This clone was then used as a template for another round of 
mutagenesis. The high temperature robot incubator temperature was set at 56°C. 
Clone 71-504 was selected as a new lead clone and another round of mutagenesis 
was completed. Incubator set at 60°C. 

25 The best 1 8 picks were selected and the best clone from this group was 

determined to be clone 78-0B10. The temperature stability of clones at various 
temperatures is presented in the FIGS. 

EXAMPLE 4: Mutagenesis Strategy From Clone 78-0B10 to 90-1B5 
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1 . 23 oligos (oligonucleotides) were ordered to change 28 positions to consensus. 
All of the oligos were tested individually using oligo directed mutagenesis 
with single stranded DNA from clone Iuc78-0BJ0 as a template to determine 
which oligos gave an improvement in stability. Below is a table which lists the 
5 mutagenic oligos. 



Description 


OLIGO SYNTHESIS 
NUMBER 


A17 to T 


DZ ID 


M25 to 1 

IVI £. IV/ 1— 


DZ 1 D 


S36 to P* removp 1 
site 


OZ 1 r 


A101 to V S105 to N 


fi91fi 


1125 to V 


QZ I & 


K1 39 to 0 


fi99H 
ozzu 


V145 to I 


R991 


V194tol 


6222 


V203 to L, S204 to P 


6231 


A216toV 


6232 


A229 to Q 


6233 


M249 to T (reversion) 


6234 


T266 to R, K270 to E 


6235 


E301 to D 


6236 


N333 to P, F334 to G 


6237 


R356 to K 


6238 


I363 to V 


6246 


A393 to P 


6247 


R417toH 


6248 


G482 to V 


6249 


N492 to T 


6250 


F499 to Y, S501 to A 


6251 


L517toV 


6252 


F537 to L 


6253 



♦Note that oligo #6234 does not change a consensus position. This oligo causes a 
reversion of position 249 to the wild-type PPE-2 codon. Although reversion of 
this position was shown to increase thermostability, reversion of this position 
decreased light output. 
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Oligonucleotide-directed mutagenesis with clone Iuc78-0B10 as a template: 

Based on the results of individually testing the mutagenic oligonucleotides 

listed above, three experiments were completed and oligos for these 

experiments were divided in the following manner: 

a. 6215,6234,6236,6248 (found to give increased stability) 

b. 

2 1 5,62 1 7,62 1 8,62 1 9,6220,622 1 ,6222,623 1 ,6233 ,6234,6236,623 8,6247,6248,6 
249,6251,6253. 

(found to be neutral or have increased stability.) 
c. All 23 oligos. 

Selections from the three experiments listed above were screened with the 
robotic screening procedure (Experiment 84). (Iuc78-0B10 used as a control). 
Selections from experiment 84 were recombined using the recombination 
mutagenesis procedure and then screened with the robotic screening procedure 
(Experiment 85). 

Single stranded DNA was prepared from three (3) clones, Iuc85-3E12 9 luc85- 
4F12 y Ii4c85-5A4. These clones were used as templates for oligonucleotide- 
directed mutagenesis to improve codon usage. Positions were selected based 
upon a codon usage table published in Nucleic Acids Research vol. 18 
(supplement) 1990. page. 2402. The table below lists oligos that were used to 
improve codon usage in E. coli. 



Description 


Oligo synthesis # 


L7-(tta-ctg), remove Apa 1 


6258 


site 




L29-(tta-ctg) 


6259 


T42-(aca-acc) 


6260 


L51 ,L56-(tta-ctg),L58-(ttg- 


6261 


ctg) 




L71-(tta-ctg) 


6262 


L85-(ttg-ctg) 


6263 


L95-(ttg-ctg),L97(ctt-ctg) 


6273 
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L113.L117-(tta-ctg) 


6274 


L151 ,L153-(tta-ctg) 


6275 


L163-(ctc-ctgl 


6276 


R187-(cga-cgt) 


6277 


L237-(tta-ctg) 


6279 


R260-(cga-cgc) 


6280 


L285,L290-(tta-ctg),L286- 
(ctt-ctg) 


6281 


L308-(tta-ctg) 


6282 


L318-(tta-ctg) 


6283 


L341 -(tta-ctg),T342-(aca- 
acc) 


6284 


L380-(ttg-ctg) 


6285 


L439-(tta-ctg) 


6286 


L456-(ctc-ctg),L457-(tta-ctg) 


6293 


T506-(aca-acc),L51 0-(cta- 
ctg) 


6305 


R530-(aga-cgt) 


6306 



6. In the first experiment, the three templates listed above from Experiment 85 
were combined and used as a templates for oligonucleotide-directed 
mutagenesis. All of the oligos were combined in one experiment and clones 
resulting from oligonucleotide-directed mutagenesis were screened using the 
robotic screening procedure as Experiment 88. There were a low percentage of 
luminescent colonies that resulted from this experiment, so another 
oligonucleotide-directed mutagenesis experiment was completed in which the 
oligonucleotides were combined in the following groups: 

a. 6258,6273,6280,6286 

b. 6259,6274,6281,6293 

c. 6260,6275,6282,6294 

d. 6261,6276,6283,6305 

e. 6262,6277,6284,9306 

f. 6263,6279,6285 
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7. It was discovered that samples from group b had a low amount of luminescent 
colonies, and it was hypothesized that one of the oligos in group b was 
causing problems. Selections were made from all of the experiments with the 
exception of experiment b. Samples were then run through the robotic 

5 screening procedure (Experiment 89). 

8. Selections from Experiments 88 and 89 were shuffled together with the 
recombination mutagenesis protocol and were then screened with the robotic 
screening procedure (Experiment 90). 

10 MATERIALS AND METHODS 

A. Mutagenesis Protocol 

The mutant luciferases disclosed herein were produced via random 
mutagenesis with subsequent in vivo screening of the mutated genes for a plurality 
of characteristics including light output and thermostability of the encoded 
15 luciferase gene product. The mutagenesis was achieved by generally following a 

three-step method: 

1. Creating genetic diversity through random mutagenesis. Here, error- 
prone PCR of a starting sequence such as that of Luc was used to create 
point mutations in the nucleotide sequence. Because error-prone PCR 

20 yields almost exclusively single point mutations in a DNA sequence, a 

theoretical maximum of 7 amino acid changes are possible per nucleotide 
mutation. In practice, however, approximately 6.1 amino acid changes per 
nucleotide is achievable. For the 550 amino acids in luciferase, 
approximately 3300 mutants are possible through point mutagenesis. 

25 2. Consolidating single point mutations through recombination 

mutagenesis. The genetic diversity created by the initial mutagenesis is 
recombined into a smaller number of clones by sPCR This process not 
only reduces the number of mutant clones, but because the rate of 
mutagenesis is high, the probability of linkage to negative mutations is 

30 significant. Recombination mutagenesis unlinks positive mutations from 
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negative mutations. The mutations are "re-linked" into new genes by 
recombination mutagenesis to yield the new permutations. Then, after re- 
screening the recombination mutants, the genetic permutations that have 
the "negative mutations" are.eliminated by not being selected. 
5 Recombination mutagenesis also serves as a secondary screen of the initial 

mutants prepared by error-prone PGR. 
3. Broadening genetic diversity through random mutagenesis of selected 
codons. Because random point mutagenesis can only achieve a limited 
number of amino acid substitutions, complete randomization of selected 

1° codons is achieved by oligonucleotides mutagenesis. The codons to be 

mutated are selected from the results of the preceding mutagenesis 
processes on the assumption that for any given beneficial substitution, other 
alternative amino acid substitutions at the same positions may produce 
even greater benefits. The positions to be mutated are identified by DNA 

15 sequencing of selected clones. 

B. Initial mutagenesis experiments 

Both the N-terminus and the C-terminus of the starting sequence were 
modified by oligonucleotide-directed mutagenesis to optimize expression and 
remove the peroxisomal targeting sequence. At the N-terminus, nine bases 

20 downstream of the initiation CODON were randomized at the C-terminus, nine 

bases upstream of the termination CODON were randomized. Mutants were 
analyzed using an in vivo screen, resulting in no significant change in expression. 

Six clones from this screen were pooled, and used to mutate the codons for 
seven cysteines. These codons were randomized using oligonucleotide-directed 

25 mutagenesis, and the mutants were screened using the robotic screening 

procedure. From this screen, fifteen clones were selected for directed evolution. 

C Generating and Testing Clones 

Several very powerful and widely known protocols are used to generate 
and test the clones of the present invention. Unless noted otherwise, these 
30 laboratory procedures are well known to one of skill in the art. Particularly noted 
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as being well known to the skilled practitioner is the polymerase chain reaction 
(PCR) devised by Mullis and various modifications to the standard PCR protocol 
(error-prone PCR, sPCR, and the like), DNA sequencing by any method (Sanger's 
or Maxxam & Gilbert's methodology), amino acid sequencing by any method 
5 (e.g., the Edman degradation), and electrophoretic separation of polynucleotides 

and polypeptides/proteins. 

D. Vector Design 

A preferred vector (pRAM) used for the mutagenesis procedure contains 
several unique features that allow for the mutagenesis strategy to work efficiently: 
10 The pRAM vector contains a filamentous phage origin, fl, which is 

necessary for the production of single-stranded DNA. 

Two Sfil sites flank the gene. These sites were designed by so that the 
gene to be subcloned can only be inserted in the proper orientation. 
The vector contains a tac promoter. 
15 Templates to be used for oligonucleotide mutagenesis contain a 4 base-pair 

deletion in the bla gene which makes the vector ampicillin-sensitive. The 
oligonucleotide mutagenesis procedure uses a mutant oligonucleotide as well as an 
ampicillin repair oligonucleotide that restores function to the bla gene. This 
allows for the selection of a high percentage of mutants. (If selection is not used, 
20 it is difficult to obtain a high percentage of mutants.) 

E. Uses of Luciferases 

The mutant luciferases of the present invention are suitable for use in any 
application for which previously known luciferases were used, including the 
25 following: 

ATP Assays . The greater enzyme stability means that reagents designed 
for detection of ATP have a greater shelf-life and operational-life at higher 
temperatures (e.g., room temperature). Therefore, a method of detecting ATP 
using luciferases with increased thermostability, is novel and useful. 
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Luminescent labels for nucleic acids, proteins, or other molecules . 
Analogous to advantages of the luciferases of the present invention for ATP 
assays, their greater shelf-life and operational-life is a benefit to the reliability and 
reproducibility of luminescent label?. This is particularly advantageous for 
5 labeling nucleic acids in hybridization procedures where hybridization 

temperatures can be relatively high (e.g. greater than 40°C. Therefore, a method 
of labeling nucleic acids, proteins, or other molecules using luciferases of the 
present invention is novel and useful. 

Genetic reporter . In the widespread application of luciferase as a genetic 

10 reporter, where detection of the reporter is used to infer the presence of another 

gene or process of interest, the increased thermal stability of the luciferases 
provides less temperature dependence of its expression in living cells and in cell- 
free translations and transcription/translation systems. Therefore a method using 
the luciferases of the present invention, as genetic reporters is novel and useful. 

15 Enzyme immobilization . Enzymes in close proximity to physical surfaces 

can be denatured by their interaction with that surface. The high density 
immobilization of luciferases onto a surface to provide strong localized 
luminescence is improved by using high stability luciferases. Therefore, a method 
of immobilizing luciferases onto a solid surface using luciferases of the present 

20 invention, is novel and useful. 

Hybrid proteins . Hybrid proteins made by genetic fusion genes encoding 
luciferases and of other genes, or through a chemical coupling process, benefit by 
having a greater shelf-life and operational-life. Therefore, a method of producing 
hybrid proteins through genetic means or chemical coupling using the luciferases 

25 of the present invention, is novel and useful. 

High temperature reactions . The light intensity of a luciferase reaction 
increases with temperature until the luciferase begins to denature. Because the use 
of thermostable luciferases allows for use at greater reaction temperatures, the 
luciferases of the present invention are novel and useful for performing high 

30 temperature reactions. 
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Luminescent solutions . Luminescence has many general uses, including 
educational, demonstrational, and entertainment purposes. These applications 
benefit from having enzymes with greater shelf-life and operational-life. 
Therefore, a method of making luminescent solutions using the luciferases of the 
5 present invention, is novel and useful. 

F. Firefly luciferase 

The firefly luciferase gene chosen for directed evolution was Luc isolated 
from Photuris pennsylvanica. The luciferase was cloned from fireflies collected in 
Maryland by Wood et al and later was independently cloned by Dr. Leach using 

10 fireflies collected in Oklahoma (Ye etal) (1977). A mutant of this luciferase 

(T249M) was made by Wood et ah and used in the present invention because it 
produced approximately 5-fold more light when expressed in colonies of £. coli. 

. Overview of Evolution Process: Directed evolution was achieved through 
a recursive process, each step consisting of multiple cycles of 1) creating 

15 mutational libraries of firefly luciferase followed by 2) screening the libraries to 

identify new mutant clones having a plurality of desired enzymological 
characteristics. 

To begin the process, three mutational libraries were created using error- 
prone PCR (Fromant etal, 1995). Each library was screened first by visual 

20 evaluation of luminescence in colonies of E. coli (Wood and De Luca, 1987), and 

then by quantitative measurements of enzymological properties in E. coli cell 
lysates. Approximately 10,000 colonies were examined in the visual screen, from 
which 704 were selected for quantitative analysis. From each quantitative screen 
18 clones were selected. 

25 The three sets of 1 8 clones each were pooled together, and a new 

mutational library was created using DNA shuffling to generate intragenetic 
recombinations (sPCR; Stemmer, 1994). The results were screened to yield 
another set of 1 8 clones. The entire process was completed by combining this set 
of 1 8 clones with 1 8 clones from the previous round of evolution, creating another 

30 mutational library by DNA shuffling, and screening as before. 
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Screening method: In the qualitative visual screen, colonies were selected 
only for their ability to sustain relatively bright luminescence. The thermal 
stability of the luciferase within the colonies of E. coli was progressively 
challenged in successive rounds of evolution by increasing the temperature of the 
5 screen. The selected colonies were inoculated into wells of 96-well plates each 

containing 200^1 of growth medium. 

In the quantitative screens, I y sates of the £. coli cultures were measured for 
1) luminescence activity, 2) enzyme stability, 3) sustained enzymatic turnover, and 
4) substrate binding. 

10 "Luminescence activity" was measured as the ratio of luminescence 

intensity to the optical density of the cell culture. 

"Enzyme stability" was determined by the rate of activity loss from cell 
lysates over 10 hours. In successive rounds of evolution the incubation 
temperature of the lysates was increased. 

15 "Sustained enzymatic turnover" was determined by the rate of 

luminescence loss of a signal enzymatic reaction over 10 hours at room 
temperature. "Substrate binding" was determined by the relative activity of the 
lysate when assayed with diluted substrate mixtures. Of these four parameters, the 
highest priority for selection was placed on thermostability. 

20 Robotic Automation. Robotic automation was used in the quantitative 

screens to accurately perform the large number of required quantitative assays on 
the cultured cells. Overnight cultures were first diluted into fresh medium and 
grown for 3 hours to produce cultures in mid-log phase growth. The optical 
densities of each cultures was then measured, and aliquots of the cultures were 

25 lysed by freeze/thaw and lysozyme. The resulting lysates were further diluted 

before analysis and incubated at elevated temperatures. Luminescence was 
measured from aliquots of the diluted lysates, taken at various times, and 
measured under various conditions as prescribed by the analytical method (see 
Example 2). Computer analysis of this data yielded the quantitative selection 

30 criteria described above. 
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Summary of evolutionary progression: After mutagenesis of the N- and 
C-termini, and randomization of the cysteine codons, a pool of 15 clones was 
subjected to two rounds of directed evolution as described herein. Five of the 18 
clones resulting from this process w.ere sequenced to identify mutations. One of 
5 these clones designated, 49-7C6, was chosen for more detailed analysis and 

further mutagenesis. This clone contained 10 new amino acid substitutions 
compared to the luciferase Luc[T249M]. 

To assess the potential for other amino acid replacements at the sites of 
these substitutions, oligonucleotide-directed mutagenesis was used to randomize 

10 these codons. The resulting clones were screened as described herein, and 1 8 

selected clones were used to initiate two new rounds of directed evolution. Of the 
18 clones resulting from this second set of rounds, the clone designated 78-0B10 
was chosen for additional study and mutagenesis. This clone encoded a luciferase 
that contained 16 new amino acid substitutions compared to Luc[T249M]. 

1 5 Using oligonucleotide directed mutagenesis with 78-OB 1 0 as the template, 

codons were selected for substitution to consensus amino acids previously known 
among beetle luciferases. Selections from this mutagenesis experiment were 
shuffled together and three clones, determined to be the most stable were then 
used as templates for oligonucleotide mutagenesis to improve codon usage in 

20 £. coli. A clone designated 90-1 B5 selected from this experiment, contained 28 

amino acid substitutions relative to Luc[T249M]. Out of 25 codons selected for 
change to consensus amino acids, 1 1 were replaced in the clone designated 90- 
1B5. Only five out of the 30 positions that were selected for improved codon 
usage were substituted and had little effect on enzyme expression. 

25 Protein purification The four mutants that are described herein 

(Luc[T249M], 49-7C6, 78-0B10, and 90-1B5) were purified using a previously 
published procedure (Hastings et al. t 1996). 

Enzymological characterization Purified proteins were diluted in 
25mmol/L HEPES pH 7.8, 1 50mmol/L NaCI, 0. 1 mmol/L EDTA, Img/mL BSA. 

30 Enzyme stability was determined from diluted proteins incubated at different 

temperatures, and aliquots were removed at different time points. A linear 
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regression of the natural log of the luminescence and time was calculated. 
Half-life was calculated as the ln(0.5)/slope of the regression. 

E. PCR Mutagenesis Protocol (Random Mutagenesis): 
PCR mutagenesis reactions 
5 1 . Prepare plasmid DNA from a vector containing the gene of interest, 

estimate DNA concentration from a gel. 
2. Set up two 50 pi reaction reactions per group: 
There are three groups of mutagenic conditions using different skewed 
nucleotide concentrations. 
10 The conditions listed herein yield in the range of from 8-10% wild-type 

Luc colonies after subcloning phenotypic for each generated parent clone. The rate 
of mutagenesis is estimated by the number of luminescent colonies that are present 
after mutagenesis. Based upon results of clones mutated in the range of 8-10%, it 
was determined that this level of mutagenesis produces on average approximately 
15 2-3 amino acid changes per gene. If the mutagenesis rate is selected so that on 

average there is one amino acid change per gene, then on average 50% of the 
clones will have no mutations. (Bowie, et al, 1 990/ 

For the master mix: add all components except polymerase, vortex, spin 
briefly, add polymerase, and mix gently. 

20 



Component AtoT/TtoA AtoC/TtoG Gtoa/QoT 



Datp 


0.3mM 


O.lmM 


0.25mM 


Dctp 


2.75mM 


4mM 


ImM 


DGTP 


0.06mM 


0.02mM 


0.05mM 


DTTP 


0.625mM 


0.3mM 


0.6mM 


♦pRAMtailUP 


0.4 pmol/ul 


0.4 pmol/ul 


0.4 pmol/ul 


♦pRAMlailDN 


0.4 pmol/ul 


0.4 pmol/ul 


0.4 pmol/ul 


*Taq. Polymerase 


lU/ul 


IU/ul 


IU/ul 


°MgCI 2 


6.77mM 


5.12mM 


2.7mM 


°MnCI 2 


0.5mM 


0.5mM 


0.3mM 


DNA 


50ng total 


50ng total 


50ng total 
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Component 


AtoT/TtoA 


AtoC/TtoG 


Gtoa/CtoT 


lOx PCR buffer 


IX 


IX 


IX 


Autoclaved nanopure 


ToSOul 


To 50 ul 


To50ul 



* Taq. Polymerase is purchased from Perkin Elmer (N808-0101). 
lOx Tag polymerase buffer (aliquot the Taq into 1 .5 ml tubes and store at -70°C): 
- lOOmM Tris-HCl pH8.4 from 1M stock 
5 - 500mM KCL 

Primers are diluted from a 1 nmol/p.1 stock to a 20 pmol/ ul working stock. 
pRAMtailup: 5'-gtactgagacgacgccagcccaagcttaggcctgagtg-3' 
pRAMtaildn: 5'-ggcatgagcgtgaactgactgaactagcggccgccgag-3 * 
° MnCl 2 and MgCl 2 are made fresh from 1M stocks. The stocks are filter 
10 sterilized and mixed with sterile water to make the lOmM and 25mM stocks which 

are then stored in Polystyrene Nalgene containers at 4°C. 

Cycle in thermal cycler: 94°C for lmin (94°C-lmin, 72°C-10min) lOx. 
3. Purify reaction products with Wizard PCR purification kit (Promega 
Corporation, Madison, Wisconsin, part#A718c): 
1 5 - transfer PCR reaction into a new tube containing Promega 100 |al 

Direct Purification buffer (Part#A724a) 

- add 1 ml of Wizard PCR Purification Resin (part#A7I8c) Promega 
and incubate at room temperature for lmin 

- pull resin though Wizard minicolumn 
20 - wash with 80% Ethanol 

spin in microcentrifuge to remove excess Ethanol 

- elute into 50 ^il sterile nanopure water (allow water to remain on 
column for at least 1 min) 
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Amplification 1 Of Mutagenesis Reaction 

1 . Set up five 50 ml reactions per group: 

- To master mix: add all components except polymerase, vortex, 
spin briefly, add polymerase, mix gently. 

5 ° 1 Ox reaction buffer for Native PFU contains 20mM MgCl 2> so no 

additional MgCl 2 needs to be added 
+ primers: 

pRAM18up -S'gtactgagacgacgccagO' 
pRAM 1 9dn -5 'ggcatgagcgtgaactgac-3 ' 
10 Cycling conditions: 94-30 sec (94-20 sec, 65-1 min, 72-3 min) 25x 

(Perkin-Elmer Gene Amp® PCR System 2400) 

2. Load 1 |il on a gel to check amplification products 

3. Purify amplification reaction products with Wizard PCR purification kit 
(Promega Corporation, part#A718c): 

15 - transfer PCR reaction into a new tube containing 100 \i\ Direct 

Purification buffer (Promega, Part#A724a) 

- add 1 ml of Wizard PCR Purification Resin (Promega 
Part#A718c) and incubate at room temperature for 1 min 

- pull resin though Wizard minicolumn 
20 - wash with 80% Ethanol 

- spin in microcentrifuge to remove excess Ethanol 

- elute with 88 \i\ sterile nanopure water (allow water to remain on 
column for at least 1 min) 



1 This amplification step with PFU Polymerase was incorporated for 2 reasons: 

(a) To increase DNA yields for the production of large numbers of transformants. 

(b) To reduce the amount of template DNA that is carried over from the mutagenic 
PCR reaction: (Primers for the second amplification reaction are nested within the 
mutagenic primers. The mutagenic primers were designed with non-specific tails 
of 1 1 and 1 2 bases respectively for the upstream and downstream primers. The 
nested primers will amplify DNA that was previously amplified with the 
mutagenic primers, but cannot amplify pRAM template DNA.) 
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Subcloning of amplified PCR mutagenesis products 

1 . Digest the DNA with Sfil as follows: 

- 2 nl Sfil (Promega Part #R639a) 

- 10 \i\ 10X buffer B (Promega Part #R002a) 

5 - 88 |al of DNA from Wizard PCR prep (see step 3 [in 

amplification]) 

- mix components and overlay with 2 drops of mineral oil; incubate 
at 50°C for 1 hour 

2. Remove salts and Sfi ends with Wizard PCR purification as described 
10 herein, and 

elute into 50 ^1 sterile nanopure water 

3. Ligation into pRAM (+/r) backbone (set up 4 ligations per group): 

- 0.025 pmol pRAM backbone 

- 0.05 pmol insert (usually in the range of 6 to 12 \il of insert) 
15 - 1 \xl of T4 DNA Ligase (Ml 80a) 

- 2 \il of lOx ligase buffer (CI 26b, divide into 25 jil aliquots, do not 
freeze/thaw more than twice) 

- water to 20 ^il 

- ligate for 2 hours at room temperature 

20 - heat reactions for 1 5 min at 70 C to inactivate ligase 

Transformation and plating 

1 . Butanol precipitate samples to remove excess salts (n-Butanol from 
Sigma, St. Louis, Missouri, part #BT-105): 

(if Ethanol precipitation is used instead of butanol awash with 70% 
25 ethanol as needed) (excess salt will cause arcing during the electroporation which 

causes the reaction to fail) 

- add water to 50 jil 

- add 500 \i\ of n-butanol 

- mix until butanol /ligation mix is clear and then spin for 20 min at 
30 room temperature 
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- drain butanol into waste container in fume hood 

- resuspend in 12 nl water, spin 30 sec at full speed 

2. Preparation of cell/DNA mix (set up 4 transformations plus one with 
reference clone DNA): . t 
5 - while DNA is precipitating, place electroporation cuvettes on ice 

- fill 15 ml Falcon snap-cap tubes with 3 ml SCC. medium and 

place on ice 

- thaw JM109 electrocompetent cells on ice (50 nl per ligation 

reaction) 

10 - pipette 10 \i\ of the bottom layer from step 1 (or 0.5 \xl ref clone 

DNA) into competent cells 

(small amounts of butanol carry-over do not adversely effect the 
transformation efficiency) 

- place cell/DNA mix on ice 
15 3. Electroporation: 

- carry tubes, cuvettes, and cell/DNA mix on ice to electroporation 

device 

- pipette cell-DNA mix into a cuvette and zap. Instrument settings: 
Cuvette gap: 0.2 cm 

20 Voltage: 2.5 kV 

Capacitance:25 ^iF 
Resistance:200 Ohms 
Time constant: 4.5 msec 

- pipette 1 ml SOC (contains KCL; media prep #KCLM) into 

25 cuvette, quickly pour into recovery tube (transformation efficiency is reduced if 

cells are allowed to sit in cuvette) 

- place the recovery tube on ice until all samples are processed 

- allow the cells to recover at 37°C for 30-60 min 

- plate on LB+amp plates with nitrocellulose filters 

30 (# of colonies is -20% higher if cells recover 60 min, possibly due to cell 

replication. See 101305 p.65) 

51 



WO 99/14336 



PCT/US98/19494 



(Best colony density for screening is 500 per plate. For the current batch of 
cells plate -500 to 750^1) 

F. Recombination Mutagenesis Protocol or DNA shuffling: 

DNase I digestion of plasmid DNA 
5 1 . Prepare 2% low melting point gel 

- use 0.8g agarose in 40 ml (NuSieve #50082) 

- use large prep comb 

- make sure it is solidified prior to digesting 
2. Prepare 4 \ig of pooled plasmid DNA for digest 

10 3. Prepare 1 DNase dilution on ice according to the table below: 



Dnase T 


0.74 jil 


1 Ox Dnasel buffer 


10 nl 


1% gelatin* 


10 nl 


Water to 100 nl 





+ DNase 1 from Sigma (D5791) 

* Gelatin was added to keep the DNase I from sticking to the walls of the tubes. 
15 This dilution can be kept on ice for at least 30 min without loss in activity. 

4. Digest (set up at room temperature): 

prepare two digests with 1.0U and 1.5U DNasel per 100 \i\ reaction: 

- 10 nl of lOx DNase I buffer (500mM Tris, lOmM MgCl2 pH 7.8) 

- x \xl DNA ( 2ng of pooled plasmid DNA from step 2) 
20 - 1 or 1 .5 fit of the lU/jil enzyme dilution 

- sterile nanopure water to 100 jil 

- incubate at room temperature for 10 minutes 

- stop reaction with lpl of 1 OOmM CDTA 



25 



Purification from agarose gel 

1 . Run DNase digested fragments on gel 
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- add 10 \i\ of lOx blue juice to each DNase I digest 

- load all on a 2% Low melting point agarose gel 

- run about 30 min at 120-1 50V 

- load pGEM DNA marker in middle lane 
5 2. Isolate fragments 

- cut out agarose slice containing fragments in the size range of 600- 
lOOObp using a razor blade 

- cut into pieces that weigh ~0.3g 

- melt the gel slices at 70°C 

10 - add 300 pi of Phenol (NaCl/Tris equilibrated) to the melted 

agarose, vortex for -1 min at max speed 

- spin for 10 min at 4°C (the interface is less likely to move around 
if it is doneat4°C) 

- remove the top layer into a tube containing an equal volume of 
15 Phenol/Chloroform/lsoamyl (saturated with 300mM NaCl /100mM Tris pH 8.0), 

vortex and spin for 5 min at RT 

- remove the top layer into a tube containing chloroform and vortex 

and spin. 

- remove the top layer into a tube with 2 vol. of 95% cold Ethanol; 
20 place in -70°C freezer for 10 min (no additional salts are needed because of the 

High Salt Phenol) 

- spin at 4°C for 15 minutes. 

- wash with 70% Ethanol, drain and air dry for -10 min 

- resuspend in 25 to 50 \i\ of sterile nanopure water 
25 - store at -70°C until ready for use 

Assembly reaction 

Set up 4 reactions and pool when completed 
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Component Concentration Amount in uJ Final concentration 



HA TP 


10 mM 

IV 1 1 LI VI 


i 
i 


ZUU UJvl 


dCTP 


1 A mNJf 


1 


200 jiM 


dGTP 


10 mM 


] 


ZUU fJJVl 


dTTP 


10 mM 


1 


200 uM 


DNA* 




5 




Tli 


3U/ul 


0.4 


0.24 U/ul 


1 OX Thermo buffer 


10X 


5 


IX 


MgCi 2 


25mM 


4 


2mM 


gelatin 


1% 


5 


0.1% 


water 




To 50ul 





* Because the DNA used for this reaction has been fragmented, it is 
difficult to estimate a concentration. The easiest way is to load 5 \xl of the DNasel 
digested DNA to an agarose gel and run the gel until the dye enters the wells (1-2 
min). Fragments from a typical 2\ig DNA digest which were resuspended in 100 
5 ^1 of water will give a DNA concentration of -1 to 10 ng/^1. See 101284 p.30 for 

a photo of this type of gel. 

Cycling conditions: 94-30sec [94-20sec, 65-1 min, 72-2min] 25x (Program 
"assembly-65", runs -2.5 h) 

Amplification of assembly 
10 Usually 5 amplification reactions will produce enough DNA for a full 8 

plate robotic run 



Component Concentration Amount in ul Final concentration 



Datp 


10 mM 


1 


200 uM 


dCTP 


10 mM 


1 


200 uM 


dGTP 


10 mM 


1 


200 uM 


cflTP 


10 mM 




200 uM 


pRAMtailup* 


20 pmol/ul 


2 


0.8 pmol/ul 


pRAMtaildn* 


20 pmol/ul 


2 


0.8 pmol/ul 


PFU native polymerase* 


2 U/ul 


1 


0.0 4U/ul 


I Ox native PFU buffer 0 


Ix 


5 


lx 
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DNA 




5 




water 




water to 50 pi 





* Note that the concentration of primers is twice as high as in a typical 
amplification reaction. 

° The PFU 10X buffer contains 20mM MgC12, so it is not necessary to add 
5 MgC12. 

. + PFU is ordered from Stratagene part #600135. 
Cycling conditions: 94-30sec [94-20sec, 65-1 min, 72-3min] 25x 

Subcloning of assembly amplification 

1 . Purify amplification products with Wizard PCR purification: 
10 - pool 5 amplification reactions 

- transfer into a new tube that contains 100 pi of Direct Purification 

buffer 

- add 1 ml of Wizard PCR Purification Resin, incubate at RT for 1 

min 

15 - pull Resin though Wizard minicolumn 

- wash with 80% ethanol and spin in microcentrifuge to remove 

excess ethanol 

- elute with 88 pi of sterile nanopure water (allow water to remain 
on column for at least 1 min) 

20 2. Digest with Sfil: 

- 2 pi Sfll 

- 10 pi lOx buffer B 

- 88 pi of DNA from Wizard PCR prep 

- mix components and overlay with 2 drops of mineral oil; incubate 
25 at 50°C for 1 hour 

3. Band isolation: 

Sometimes after amplification of the assembly reaction a band that is 
smaller than the gene-sized fragment is produced. This small fragment has been 
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shown to subclone about 10-fold more frequently than the gene sized fragment if 
the sample is not band isolated. When this contaminating band is present, it is 
necessary to band isolate after Sfi 1 digestion. 

- load the DNA to aQ.7% agarose gel 

5 - band isolate and purify with the Gene Clean kit from Bio 101 

- elute DNA with 50 \x\ sterile nanopure water, check concentration 
on gel (This type of purification with standard agarose produced the highest 
number of transformants after subcloning. Other methods tried: Low melt with 
Phenol chloroform, Gene clean with low melt, Wizard PCR resin with standard 

10 agarose, Pierce Xtreme spin column with Low melt (did not work with standard 

agarose)). 

4. Ligate into pRAM [+/r] backbone: (See ligation and transformation 
protocol above) 

Large scale preparation of pRAM backbone 

15 1 . Streak an LB amp plate with pRAMMCS [+/r] (This vector contains a 

synthetic insert with a Sacll site in place of a gene. It can be found in 
-70°C in box listed pRAM glycerol stocks position b2. This vector 
contains the new ribosome binding site, but it will be cut out when the 
vector is digested with Sfil. 

20 2. Prepare a 1 0 ml overnight culture in LB supplemented with amp. 

3. The next day inoculate 1L of LB supplemented with amp and grow for 
16-20 hours. 

4. Purify the DNA with the Wizard Maxi Prep kit. (use 4 preps for 1L of 
cells) 

25 5. Digest the Plasmid with Sfil. (Use 5U per microgram) Overlay with 

mineral oil and digest for at least two hours. 

6. Ethanol Precipitate to remove salts. Resuspend in water. 

7. Digest with Sacll for 2 hours, (keep digest volume to 2 ml or less). 

It is possible that part of the plasmid could be partially digested. If the 
30 vector is cut with an enzyme that is internal to the two Sfil sites, it will 
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keep the partially digested fragments from joining in a ligation 
reaction. 

8. Load entire digest onto a column (see 9). The volume of the sample 
load should not be more than 2 ml. If it is it will be necessary to 

5 ethanol precipitate. 

9. The column contains Sephacryl s-1000 and is stored with 20% ethanol 
to prevent bacterial contamination. Prior to loading the sample the 
column must be equilibrated with cold running buffer for at least 24 
hours. If the column has been sitting more than a couple of months it 

10 may be necessary to empty the column, equilibrate the resin 3-4 

washes in cold running buffer, and then re-pour the column. After the 
column is poured it should be equilibrated overnight so that the resin 
is completely packed. 

10. Collect fractions of -0.5ml. Typically the DNA comes offbetween 
15 fractions 25 and 50. Load a five \x\ aliquot from a range of fractions 

to determine which fractions contain the backbone fragment. The 
small insert fragment will start to come off the column before all of 
the backbone is eluted, so it will be necessary to be conservative when 
fractions are pooled. For this reason typically 40-60% of the DNA is 
20 lost at this step. 

1 1 . Pool the fractions that contain the backbone. 

12. Ethanol precipitate the samples. Resuspend in a volume that produces 
-10-50 ng/ 

13. Store at -70°C. 

25 

Column running buffer: (store at 4°C) 
5 mM EDTA 
100 mM NaCl 
50 mM Tris-HCL pH 8.0 
30 10ng/mltRNA (R-8759) 
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H. Oligonucleotide Mutagenesis: 

Prepare Ampicillin-sensitive Single stranded DNA of the template to be mutated. 
Design a mutagenic primer that will randomly generate all possible amino acid 
5 codons. 

Mutagenesis reaction: 



Component 


Final concentration 


Single Stranded Template 


0.05pmol 


Mutagenic OJigo 


1.25pmo! 


AmpiciUin Repair Oligo (Promega q63 la) 


0.25pmol 


I OX annealing buffer 


IX 


Water to 20 ul 




♦Annealing buffer: 

-200mM Tris-HCl, pH 7.5 
-lOOmM MgC12 
-500mM NaCI 




Heat reaction at 60°C for 1 5 minutes and then immediately place on ice. 
Synthesis reaction: 


Component 


Amount 


Water 


5 ul 


10X synthesis buffer 


3ul 


T4 DNA Polymerase (Promega m421a) 


lul (10 Units) 


T4 DNA Ligase (Promega 180a) 


1 ul (3 Units) 



♦Synthesis buffer 

lOOmM Tris-HCl, pH 7.5 
5mM dNTPs 
20 lOmMATP 
20mM DTT 
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Incubate at 37C for 90 minutes. 

Transform into Mut-S strain BMH 71-18 (Promega strain Q6321) 
-Place Synthesis reaction in a 17X100mm tube. 
-Add BMH 71-18 competent cells that have been thawed on ice to 
5 synthesis reaction. 

-Incubate on ice for 30 min 

-Heat Shock cells at 42°C for 90 seconds. 

-Add 4 ml of LB medium and grow cells at 37C for 1 hour. Add 
Ampicillin to a final concentration of 1.25ug/ml and then grow overnight at 37°C. 
10 Isolate DNA with Wizard Plus Purification system (Promega a7100) 

Transform isolated DNA into JM109 electro-competent cells and transform 
onto LB Ampicillin plates. 

I. Screening procedure: 

JM109 clones (from a transformation reaction) are plated onto 
15 nitrocellulose filters placed on LB amp plates at a screening density of -500 

colonies per plate. 

As listed in the Random Mutagenesis procedure, approximately 10% of the 
clones to be selected will have to be as stable as the same sequenced or better than 
source. Or stated another way, -50 colonies per plate will be suitable for 
20 selection. There are 704 wells available for a full eight plate robotic run, so at 

least 1 5 LB amp plates will be needed for a full robotic run. 

After overnight growth at 37°C the plates contains the transformants are 
removed from the incubator and placed at room temperature. 

The nitrocellulose filter is lifted on one side and 500 ^1 of lOmM IPTG is 
25 added to each of the plates. The filter is then placed back onto the plate to allow 

diffusion of the IPTG into the colonies containing the different mutant luciferase 
genes. The plates are then incubated for about 4 hours at room temperature. 

One (1) ml of a solution contains ImM Luciferin and lOOmM Sodium 
Citrate is pipetted onto a slide warmer that is set at 50°C. A nitrocellulose filter 
30 that contains mutant luciferase colonies and has been treated with IPTG is then 



59 



WO 99/14336 



PCT/US98/19494 



placed on top of the luciferin solution. After several minutes, the brightest 
colonies are picked with tooth picks which are used to inoculate wells in a 
microtiter plate that contain M9- minimal media with \% gelatin. 

After enough colonies are pipked to 8 microtiter plates, the plates are 
placed in an incubator at 350rpm at 30°C incubation and are grown overnight. 

In the morning the overnight plates are loaded onto the robot and the cell 

dilution procedure is run. (This procedure dilutes the cultures 1:10 into induction 

medium). The new plates are grown for 3 hours at 350rpm at 30°C. 

After growth, the plates are loaded to the robot for the main assay 
procedure. 

Minimal Media: 

6g/Liter Na2HP04 
3g/Liter KH2P04 
0.5g/LiterNaCl 
15 lg/LiterNH4Cl 
2mM MgS04 
0.1 mM 

ImM Thiamine-HCl 
0.2% glucose 
20 1 2ug/ml Tetracycline 

lOOug/ml ampicillin 

*Overnight media contains 1% gelatin 
♦Induction media contains ImM IPTG and no gelatin. 
25 S.O.C. Media 

-lOmMNaCl 
-2.5mMKCl 
-20mMMgC) 
-20mM glucose 



5 



10 
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-2% bactotryptone 
-0.5% yeast extract 
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TABLE 1: Parameters Characterizing Luciferases of Clones Derived for 
Various Experiments 



Control is 
PPE-2 39- 
5B10at51C. 



Experiment 


Clone 
ID 


Li 


tau 


Km 


S 


40 


0a7 


1.04 


4.5 


0 78 


1 


40 


5h4 


1.29 


1.61 


1 16 


0 953 


40 


0c2 


1.13 


1.54 


0 91 


0 998 


40 


5q4 


1 


1.4 


0 85 


1 


40 


6d3 


1.02 


1.37 


0 79 


1 
1 


40 


1q4 


1.06 


1.28 


0 77 


0 QfiS 


40 


1d4 


1.69 


1.23 


0 73 


1 
1 


40 


0h9 


1.26 


1.21 


0.63 


0 998 


40 


2f6 


3 


1.07 


0.49 


0.981 


40 


7d6 


3.09 


1.058 


1.09 


1.013 


40 


5a7 


4.3 


1.025 


0.93 


1.008 


40 


4c8 


1 


1 


0.33 


1.004 


Experiment 


Clone 
ID 


Li 


tau 


Km 


s 


41 


7h7 


0.73 


2.4 


2.1 


0.995 


41 


5a5 


0.77 


1.93 


2.7 


1.002 


41 


2c12 


1.06 


1.7 


0.91 


1.003 


41 


6e5- 


1.16 


1.62 


i 1-53 


0.997 


41 


4e5- 


1.08 


1.37 


1.4 


1.004 


41 


6g7 


1.3 


1.27 


1.39 


0.999 


41 


1h4 


1.36 


1.24 


0.56 


0.994 


41 


0c11 


4.1 


1.23 


1.24 


0.996 


41 


2h9 


5.3 


1.01 


0.83 


0.986 


42 


u 6b10 


0.97 


3.6 


0.97 


0.997 


42 


1c3 


0.91 


2.1 


0.6 


0.998 


42 


7h9 


0.8 


1.8 


0.8 


0.982 


42 


6b2 


0.77 


1.72 


0.8 


0.978 


42 


6d6 


0.83 


1.7 


0.733 


0.975 


42 


4e10- 


0.77 


1.63 


1.8 


0.954 


42 


1b5 


0.83 


1.41 


1.05 


0.955 


42 


GeB- 


0.71 


1.16 


0.89 


0.955 


42 


Sag 


0.85 


1.3 


0.86 


0.997 


42 


6b6 


2.7 


1.3 


0.91 


1.02 
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42 


6e9- 


1.5 


1.27 


0.98 


1.01 


42 


3h11 


1.73 


1.21 


0.63 


0.985 


42 


1a2 


I 1.11 


1.17 


0.77 


1.005 


42 


3f7 


0.49 


1.16 


1.13 


0.944 


42 


1a4 


2 


1.01 


0.76 


t 0.996 



Control Is 
PPE-2 40- 
0A7 at 54C 



Experiment 


Clone 
ID 


Li 


tau 


Km 


S 


46 


2h3 


0.86 


6.4 


0.37 


0.96 


46 


4a9 


0.67 


5.7 


0.66 


0.997 


46 


2g4 


0.65 


5.3 


0.78 


0.96 


46 


5d12 


0.94 


4.9 


0.94 


1.002 


46 


1h11 


1.02 


4.8 


0.84 


0.998 


46 


5a10 


1.23 


4.4 


0.81 


0.9842 


46 


0a8 


1.35 


4.3 


0.89 


1 


46 


4d3 


0.51 


3.6 


0.65 


0.975 


46 


2a3 


1.17 


2.9 


0.57 


0.988 


46 


3b11 


1.39 


2.5 


0.63 


1.02 


46 


7g12 


1.49 


2.5 


0.91 


1.02 


j 46 


0g9 


1.86 


2.25 


0.5 


0.998 


46 


7h8 


1.07 


1.36 


0.52 


0.99 


46 


1 9 8 


0.3 


1.31 


0.72 


0.92 


46 


1d3 


1.74 


1.13 


1.02 


1.001 


46 


0c3 


1.68 


1.01 


0.74 


1.01 


46 


5c11 


0.82 


1.01 


0.6 


0.95 


Control is 
PPE-2 46- 
2h3 at 54. 
Experiment 


Clone 
ID 


Li 


tau 


Km 


S 


49 


6c10 


0.57 


2.2 


0.98 


1 


49 


u 7c6 


1.12 


1.9 


0.93 


1.01 


49 


0g12 


1 


1.58 


0.69 


1.08 


49 


7a5 


1.08 


1.44 


1.1 


0.99 


49 


1f6 


0.66 


1.13 


1.04 


1.006 


49 


0b5 


0.76 


1.07 


1.03 


0.98 


49 


4a3 


0.94 


1.06 


0.77 


1 
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Control is 
PPE-2 49- 
7C6 at 56C 



Experiment 


Clone 


Li 


tau 


Km 


S 




ID 










56 


2d12 


0.97 


2.9 


0.29 


1.006 


56 


5g10 


1.01 


2.77 


0.64 


1.007 


| 56 


3d5 


1.32 


2.25 


1.85 


1.03 


Experiment 


Clone 


Li 


tau 


Km 


S 



ID 



57 


3d1 


1.06 


2.9 


1.05 


1.02 


| 57 


6g12 


1 


2.7 


L 0.87 


1.004 


57 


4c1 


0.79 


2.6 


0.93 


1.014 


i 57 


5f10 




1.9 


0.64 


1.03 


57 


1e6- 


0.84 


1.49 


0.984 


0.9871 


57 
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PPE-2 78- 
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TABLE 2: Stability Of Luciferase Activity At Different Temperatures (Half- 
Life In Hours) 





Room 
Temperature 


37°C 


50°C 


60° 


Luc[T249M] 


110 


* 0.59 


0.01 




49-7C6 


430 


68 


31 


6.3 


78-OB10 


3000 


220 


47 


15 
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TABLE 3: Michaelis-Menten Constants for Mutants Created by Directed 
Evolution 





K m -luciferin 


K m -ATP 


Luc[T24] 


0.32nM 


18|iM 


49-7C6 


0.99nM 


14nM 


78-0B10 


1.6nM 


3.4nM 


90-1B5 


2.2nM 


3.0nM 
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TABLE 4: 



Components Concentration Amount in 50^ Final concentration 



DATP 


10 mM 


1 


0.2mM 


DCTP 


10 mM 




0.2 mM 


DGTP 


10 mM 




0.2 mM 


DTTP 


10 mM 




0.2 mM 


+pRAM18up 


20 pmol/fi 1 




0.4 pmol/uJ 


+pRAM19dn 


20 pmol/ul 




0.4 pmol/ul 


PFU 


2U/ul 




0.04 u/uL 


°10x buffer 


lOx 


5 


lx 


DNA 




10 from purified wiz. 




Water 




24.6 
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TABLE 5: 


Summary of Evolutionary Progression 




O 


Start with LucPpe2[T249M] 




© 


Mutate 3 amino acids at N- and C-termini 


5 


© 


Mutate 7 cysteines 




Q 


reiTorm two iterations oi evolution Luc<?y-7Co 




0 
© 


Mutagenesis of altered codons (9) 

Two iterations of evolution -» Luc78-0BI0 






Mutagenesis of consensus codons (28) 


10 


o 


Mutagenesis of codon usage (24) Luc90-]B5 
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TABLE 6: One Iteration of Recursive Process 

© 1 clone -> 3 libraries using error-prone PCR 

• 3 x Visual screen (-10,000 clones each) 

• 3 x Quantitative screen (704) clones each) 
0 3x18 clones -» library using sPCR 

• Visual screen (-10,000 clones) 

• Quantitative screen (704 clones) 
© 1 8 + 1 8 library using sPCR 

• Visual screen (-10,000 clones) 

• Quantitative screen (704 clones) 
O Output: 1 8 clones 
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WE CLAIM: 

1 . A second beetle luciferase with increased thermostability as 
compared with a first luciferase, said second luciferase made by the following 
5 method: 

a) mutating a polynucleotide sequence encoding the first 
luciferase to obtain a polynucleotide sequence encoding the second luciferase; 

b) selecting the second luciferase if a plurality of characteristics 
including thermostability of a luciferase is in a preferred range. 

10 2. The second luciferase of claim 1, wherein the polynucleotide 

sequence encoding the first luciferase is the same as the sequence of Luc 
(T249M). 

3. The second luciferase of claim 1, wherein thermostability is at least 
2 hours at about 50°C in aqueous solution. 

'5 4. The second luciferase of claim 3, wherein thermostability is at least 

5 hours at 50°C in aqueous solution. 

5. The second luciferase of claim 1 , wherein the plurality of 
characteristics comprises brightness of luminescence, substrate utilization and 
luminescence signal. 

20 s 6. The second luciferase of claim 1, wherein the mutating is by 

directed evolution. 

7. A beetle luciferase that is thermostabile for at least 2 hours at 50°C 
in aqueous solution. 

8. The luciferase of claim 7, that is thermostabile for at least 5 hours at 

25 50°C. 
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9. The luciferase of claim 7, wherein less than 5% luminescence 
activity is lost after incubation in solution for 2 hours at about 50°C 

1 0. A method for preparing a beetle luciferase with increased 
thermostability, said method comprising the following steps: 

a) mutating a polynucleotide sequence encoding a first 
luciferase to obtain a sequence encoding a second luciferase; and 

b) selecting the second luciferase if a plurality of characteristics 
including thermostability of a luciferase are in a preferred range. 

1 1 . The method of claim 1 0, wherein thermostabiity is at least 2 hours 
at 50°C. 

12. The method of claim 1 1, wherein the thermostability is at least 5 
hours at 50°C. 

13. The method of claim 10, wherein mutating occurs at at least one 
position wherein a consensus amino acid is present in beetle species. 

14. The method of claim 10, wherein mutating occurs at at least one 
position where a mutation occurred to produce the luciferase gene designated 
Iuc90-1B5. 

15. A DN A molecule having a nucleotide sequence that encodes a 
mutant luciferase with increased thermostablility as compared to the 
thermostability of a native luciferase. 

16. The DNA molecule of claim 15, wherein the nucleotide sequence is 
selected from the group consisting of sequences. 
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GGAT CCAATGGAAGAT AAAAATATTTTATATGGACCTGAACCATT TTATC CC TT GGCT GA 
TGGGAC GGCTGGAGAACAGATGTTTTACGC ATTATC TCGTTATGCAGATATTTCAGGATG 
CATAGCATTGACAAATGCTCATACAAAAGAAAATGTTTTATATGAAGAGTTTTTAAAATT 
GTC GTGTC GT TTAGCGGAAAGT TTTAAAAAGTAT GGATTAAAAC AAAACGAC AC AATAGC 
GGTGTGTAGC GAAAATGGTTTGCAATTTTTCCTTCC TATAATTGCATCATTGTATC TTGG 
AATAATTGCAGCAC CTGT TAGT GATAAATACATTGAACGTGAAT TAAT AC AC AGTCTT GG 
TATTGTAAAACCACGCATAATTTTTTGCTCCAAGAATACTTTTCAAAAAGTACTGAATGT 
AAAATCTAAATTAAAATATGTAGAAACTATTATTATATTAGACTTAAATGAAGACTTAGG 
AGGTTATC AATGCC TC AACAAC TT TATT TC TCAAAATTCC GATATTAATC TT GACGTAAA 
AAAATT TAAAC C ATAT TC TT TT AATC GAGAC G ATCAGGTT GC GT T GGT AATGTT TT C T TC 
T GGTACAACTGGTGTTTC GAAGGGAGTCATGCTAACTCACAAGAATATTGTTGC ACGATT 
T T CT C TTGCAAAAGAT CC TAC TTTT GGTAAC GCAATTAATCC AAC GAC AGCAAT TTTAAC 
GGTAATAC CTTTCCAC CATGGTTTTGGTATGATGACCACATTAGGATACTTTAC TTGT GG 
AT TC C GAGTTGTTC TAAT GC ACAC GTTTGAAGAAAAACTATTTC TACAAT CATTAC AAGA 
T T AT AAAGTGGAAAGT AC TT TACTTGTAC C AACATT AATGGC ATT TC TTGCAAAAAGTGC 
ATTAGTTGAAAAGTAC GATT TATC GC AC TTAAAAGAAATT GCAT C TGGTGGC GC AC C T TT 
ATC AAAAGAAAT TGGGGAGAT GGT GAAAAAACGGTT TAAATT AAAC TT TGT C AGGC AAGG 
GTATGGATTAAC AGAAAC CACTTC GGCTGTTTTAATTACACCGAACAATGAC GTCAGACC 
GGGATC AAC TGGTAAAAT AGTAC CATTT CAC GCT GTTAAAGTTGTCGAT CC T AC AACAGG 
AAAAATTTTGGGGCCAAATGAAACTGGAGAATTGTATTTTAAAGGCGACATGATAATGAA 
AGGT TATT AT AAT AAT GAAGAAGC TAC T AAAGCAATTAT TAACAAAGAC GGATGGTTGCG 
C TCT GGTGAT AT T GCT TATT AT GACAAT GATGGC CATTTTTATAT TGT GGAC AGGCTGAA 
GTCATT AATT AAATAT AAAGGTTATC AGGTTGCACC TGC TGAAAT TGAGGGAAT AC TC TT 
ACAACATCCGTATATTGTTGATGC CGGC GT TACTGGTATACC GGATGAAGC CGC GGGCGA 
GCTTCC AGCT GCAGGTGT TGTAGTAC AGACTGGAAAATATCTAAACGAAC AAATCGTACA 
AAATTTTGTTTCCAGTCAAGTTTCAACAGCCAAATGGCTACGTGGTGGGGTGAAATTTTT 
GGATGAAATTCC CAAAGGATCAAC TGGAAAAATTGACAGAAAAGTGTTAAGACAAATGTT 
T GAAAAAC AC ACCAATGGG * 



GGATC C AATGGAAGATAAAAAT AT TT TATATGGACC TGAACC ATTTTATC C C TTGGCTGA 
T GGGAC GGC T GGAGAACAGATGTT TT AC GC ATTATC TCGT TATGCAGATATTTC AGGATG 
CATAGCATTGACA.WTGCTCATACAAAAGAAAATGTTTTATATGAAGAGTTGTTAAAATT 
GTC GTGTC GTTTAGCGGAAAGT TT TAAAAAGTAT GGATTAAAAC AAAAC GACAC AATAGC 
GGTGTGTAGC GAAAAT GGTTTGCAATTT TT C C TT CC TATAAT TGC ATCAT TGTATCTTGG 
AATAATTGCAGCAC CT GTTAGT GATAAAT ACATT GAAC GT GAATT AAT AC AC AGT CTTGG 
TATTGT AAAAC CAC GCAT AATT TT TT GC TC CAAGAATAC T TT TC AAAAAGTACT GAATGT 
AAAATC TAAATT AAAATATGTAGAAAC TAT TATTAT AT TAGACT T AAAT GAAGAC TT AGG 
AGGT TAT C AATGC C TC AACAAC TT TATT TC TCAAAATTCC GATATTAATC T GGAC GTAAA 
AAAATT TAAAC C ATAT TC TTTT AATC GAGACGAT CAGGTT GC GT TGGT AATGTT TTC T TC 
T G GT ACAAC TGGTGTT TC GAAGGGAGTC AT GC TAACTC AC AAGAATAT TGT T GC AC GATT 
TT CT CATGCAAAAGATC C TACT TTTGGTAACGCAAT TAAT C C AAC GAC AGCAAT TT TAAC 
GGTAATAC CTTTCCAC CATGGTTTTGGTATGATGACCACATTAGGATACTTTAC TTGTGG 
ATTCCGAGTTGTTCTAATGCACACGTTTGAAGAAAAACTATTTCTACAATCATTACAAGA 
TTATAAAGTGGAAAGTACTTTACTTGTACCAACATTAATGGCATTTTTTGCAAAAAGTGC 
ATTAGT T GAAAAGT AC GAT TTATC GC AC TTAAAAGAAATT GCAT C T GGTGGC GC AC CTTT 
AT CAAAAGAAATTGGGGAGATGGT GAAAAAAC GGTT TAAATT AAAC TT TGTC AGGC AAGG 
GT AT GGATTAACAGAAAC CACTTC GGCTGTTTTAAT TACACC GAACAATGAC GTCAGACC 
GGGATC AACT GGTAAAATAGTACC ATTTCAC GC TGT TAAAGT TGTC GATC CTACAACAGG 
AAAAATTTTGGGGCCAAATGAAACTGGAGAATTGTATTTTAAAGGCGACATGATAATGAA 
AGGT TATT ATAATAATGAAGAAGC TACT AAAGCAATTATTAACAAAGACGGATGGT TGCG 
CTCT GGTGAT ATTGC TTATTATGACAAT GATGGC CATT TTTATATTGT GGAC AGGC TGAA 
GTCATTAATTAAATATAAAGGTTATCAGGTTGCACCTGCTGAAATTGAGGGAATACTCTT 
AC AACATCCGTATATTGTTGATGC CGGC GTTACTGGTATACCGGATGAAGCC GC GGGCGA 
GC TTCC AGCT GCAGGT GT TGTAGT AC AGAC T GGAAAAT ATCTAAACGAAC AAAT CGTACA 
AAAT TTTGTTTC CAGTCAAGTTTC AACAGCCAAATGGC TACGTGGTGGGGTGAAATTTTT 
GGAT GAAATT C C CAAAGGAT CAAC TGGAAAAATT GACAGAAAAGT GTT AAGACAAATGTT 
T GAAAAAC AC ACCAATGGG* 
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GGAT CC AAT GGAAGATAAAAATATTT TAT ATGGAC C TGAAC C ATTT TATC CC TTGGCT GA 
T GGGAC GGC T GGAGAACAGATGTTTT AC GC ATTATC TC GTTATGC AGATATTTC AGGATG 
C ATAGC AT TGAC AAATGC TC ATAC AAAAGAAAATGT TTTATATGAAGAGT TT TTAAAATT 
GT C GTGT C GT TT AGCGGAAAGT TTT AAAAAGTATGGAT TAAAAC AAAAC GAC AC AAT AGC 
GGTGTGTAGC GAAAATGGTT TGCAAT TT TT C C TT CC TATAATTGC ATC ATTGTATC TT GG 
AATAATTGCAGCACCTGTTAGTGATAAATACATTGAACGTGAATTAATACACAGTCTTGG 
T ATT GT AAAAC C AC GC ATAATTTTTT GCTC CAAGAATACTTTTCAAAAAGTACTGAATGT 
AAAATC TAAATTAAAATATGTAGAAACT ATTATTATATTAGACTT AAATGAAGACT TAGG 
AGGT TAT C AATGC C TCAACAACTT TATTTC T CAAAATTC C GATAT TAATC TTGAC GTAAA 
AAAATT TAAACCATATTC TTTT AATC GAGAC GAT CAGGTTGC GT T GGT AATGTTTTCTTC 
TGGTAC AACTGGTGTTTC GAAGGGAGTC ATGCTAAC TCACAAGAATATTGTTGTACGATT 
TT CTC TTGCAAAAGATCC TACT TT TGGTAACGC AAT TAATC CAAC GAC AGCAATTTTAAC 
GGTAAT AC C TTTC CAC CATGGT TT TGGT AT GATGAC C ACATTAGGATACT TT ACTTGTGG 
ATTC C GAGTT GTTCTAATGC AC ACGT TT GAAGAAAAACTATTTC T ACAATCATTACAAGA 
TTATAAAGTGGAAAGTACTTTACTTGTACCAACATTAATGGCATTTCTTGCAAAAAGTGC 
ATTAGTTGAAAAGTAC GATTTATCGC AC TTAAAAGAAATT GCATC TGGTGGCGCAC CTTT 
ATC AAAAGAAATTGGGGAGAT GGTGAAAAAAC GGTT TAAATTAAAC TT TGTC AGGC AAGG 
GTATGGATTAAC AGAAAC C AC TTC GGCT GT TT TAAT TACAC C GAACAATGAC GTCAGACC 
GGGATCAAC TGGT AAAAT AGT AC CAT TT CACG C T GTTAAAGT TGT C GATC C TACAACAGG 
AAAAATTTTGGGGC CAAATGAAACT GGAGAATTGTATTTTAAAGGCGACATGATAATGAA 
AGGT TATTATAATAATGAAGAAGC TAC T AAAGCAAT TATT AC C AAAGAC GGATGGT TGCG 
C T CTGGTGATATTGCTTATTATGACAAT GATGGC CATTTT TATAT TGT GGAC AGGC TGAA 
GT CATTAATTAAATATAAAGGTTATC AGGTTGCACC TGCTGAAATTGAGGGAAT ACTCTT 
AC AAC ATC C GTATATTGTTGATGC C GGC GT TACT GGTATAC C GGATGAAGCC GC GGGC GA 
GC TTCCAGCTGCAGGTGTTGTAGTAC AGAC TGGAAAATATCTAAACGAAC AAATCGTACA 
AAAT TT TGTT TC CAGT C AAGTTTC AACAGC CAAATGGCTAC GTGGTGGGGTGAAAT TT TT 
GGAT GAAATT C C CAAAGGAT CAAC T GGAAAAATT GACAGAAAAGT GTTAAGAC AAATGTT 




' GAAAAAC AC AC CAATGGG * 



GGATCCAATGGA7VGATAAAAATATTTTATATGGACCTGAACCATTTTATCCCTTGGCTGA 
TGGGAC GGCT GGAGAACAGATGTT TT AC GC AT TATC TC GT TATGCAGATATT TC AGGATG 
C ATAGC AT TGAC AAAT GC TC AT AC AAAAGAAAAT GT TTTATATGAAGAGT TT TT AAAATT 
GTCGTGTCGTTTAGCGGAAAGTTTTAAAAAGTATGGATTAAAACAAAACGACACAATAGC 
GGTGTGTAGC GAAAATGGTT TGCAAT TTTTCCTTCC TATAATTGC ATC AT TGTATCTTGG 
AAT AAT TGCAGC AC CT GT TAGTGATAAATACATTGAACGTGAAT T AATAC AC AGTC TT GG 
TATT GTAAAACC AC GC AT AATTTT TT GC TCCAAGAATAC TTTT C AAAAAGTACT GAAT GT 
AAAATC TAAATT AAAATATGT AGAAACT ATTATT AT ATT AGAC T T AAATGAAGACT TAGG 
AGGT TATC AATGCC TC AACAAC TT TATT TC T CAAAATTC C GATAT TAATC TTGAC GTAAA 
AAAATTTAAACCATATTCTTTTAATCGAGACGATCAGGTTGCGTTGGTAATGTTTTCTTC 
TGGTACAACTGGTGTTTCGAAGGGAGTCATGCTAACTCACAAGAATATTGTTGCACGATT 
TTCTATTGCAAAAGATCC TACTTTTGGTAACGCAATTAATCCAAC GACAGCAATTTTAAC 
G GT AAT AC C T T T CCAC CATGGT TT TGGT AT GATGAC C ACATTAGGATAC TTT AC TT GTGG 
ATT C C GAGTT GTTC TAAT GC AC AC GT TT GAAGAAAAACTATTTC T ACAAT CATT AC AAGA 
T TATAAAGTGGAAAGT AC TTTACTTGTAC C AACATTAATGGC ATTTCTTGCAAAAAGTGC 
AT TAGT TGAAAAGTAC GATTTATC GC AC TTAAAAGAAATTGCATC TGGTGGC GC AC CTTT 
ATCAAAAGAAATTGGGGAGATGGT GAAAAAAC GGTT TAAATT AAACT TTGTC AGGC AAGG 
GT AT GGAT TAAC AGAAAC CAC TTC GGC T GT TT TAAT TACAC C GAACAATGAC GT CAGAC C 
GGGATCAAC T GGT AAAAT AGTACC AT TT CACGCT GT TAAAGTTGT C GATC C TACAACAGG 
AAAAATTT TGGGGCC AAATGAAAC TGGAGAAT TGTATT TTAAAGGC GACATGAT AATGAA 
AGGT TATT AT AAT AAT GAAGAAGC TAC T AAAGCAAT T AT T AACAAAGAC GGAT GGT T GC G 
CTCTGGTGATATTGCTTATTATGACAAT GATGGC CATTTT TATATTGTGGACAGGCTGAA 
GTCATTAATTAAATATAAAGGTTATCAGGTTGCACCTGCTGAAATTGAGGGAATACTCTT 
ACAACATC C GTATATTGTTGATGC CGGC GTTACT GGTATACC GGATGAAGC C GC GGGC GA 
G C TTC CAGCTGC AGGT GT TGT AGT AC AGAC TGGAAAAT AT CT AAAC GAAC AAAT CGTACA 
AAATTTTGTTTCCAGTCAAGTTTCAACAGCCAAATGGCTACGTGGTGGGGTGAAATTTTT 
GGATGAAAT TCC CAAAGGAT CAAC TGGAAAAATT GAC AGAAAAGTGTTAAGAC AAATGTT 
T GAAAAAC AC AC CAATGGG * 
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JL 



GGATCCAATGGAAGATAAAAATATTTTATATGGACC TGAACCATTTTATCC CTTGGCTGA 
TGGGACGGCTGGAGAACAGATGTT TGAC GC ATTATCTCGTTATGC AGATATTTC AGGATG 
CATAGCATTGACAAATGCTCATACAAAAGAAAATGTTTTATATGAAGAGTTTTTAAAATT 
GT CGTGTC GTTT AGCGGAAAGTTT TAAAAAGT AT GGAT TAAAAC AAAACGAC ACAATAGC 
GGTGTGTAGC GAAAAT GGTT TGCAAT TTTT C C TT CC TATAATTGC ATC ATTGTATC TTGG 
AATAATTGCAGCAC CT GT TAGTGATAAATACATT GAAC GT GAATT AAT AC AC AGTCTTGG 
T ATT GT AAAACC AC GC ATAATTTTTT GCTC CAAGAATACT TTTCAAAAAGTACT GAATGT 
AAAATC TAAAT T AAAATATGTAGAAACTAT TATT AT AT TAGACTT AAATGAAGACT TAGG 
AGGTTATCAATGCC TCAACAAC TT TATTTC TC AAAATTCC GATATTAATC TTGACGTAAA 
AAAATTTAAAC CATAT TCTT TT AATC GAGAC GAT CAGGTT GC GTTGGTAATGT TTT C TTC 
T GGTAC AAC T GGTGTT TC GAAGGGAGTC ATGC TAAC TC AC AAGAATATTGTT GC AC GATT 
TT C TCATGCAAAAGAT CC TACT TT TGGTAACGC AAT TAATCC AACGACAGCAATTT TAAC 
GGTAAT AC C T TTC C AC CATGGT TT TGGT AT GA tGAC CACATT AGGATACTTTAC TTGTGG 
AT TC CGAGTT GTTC TAATGC AC AC GT TTGAAGAAAAACTATT TC TACAATCATTAC AAGA 
TTATAAAGTGGAAAGTACTTTACTTGTACCAACATTAATGGCATTTTTTGCAAAAAGTGC 
AT T AGT TGAAAAGT AC GATTTATC GC AC TT AAAAGAAATTGC ATC TGGT GGC GCAC CTTT 
ATC AAAAGAAATTGGGGAGATGGT GAAAAAAC GGTT TAAATT AAAC TTT GTC AGGCAAGG 
GT ATGGAT TAAC AGAAACCACT TC GGC T GT TTTAAT TAC AC C GAACAATGAC GT CAGAC C 
GGGAT CAAC T GGTAAAATAGTACC AT TT CACG C T GT TAAAGT TGTC GATC CTAC AACAGG 
AAAAATT T TGGGGC CAAATGAAAC TGGAGAAT TGTATT TT AAAGGC GACAT GAT AATGAA 
AGGT T ATT AT AATAAT GAAGAAGC TAC T AAAGCAAT TATT AACAAAGAC GGATGGT TGC G 
C TCT GGT GATATTGCTTATT ATGACAATGATGGC CATT TTTATATTGTGGAC AGGC TGAA 
GT CATT AATT AAAT AT AAAGGTTATC AGGT TGC ACC TGCT GAAAT T GAGGGAAT AC TCTT 
AC AACATCCGTATATTGTTGATGC CGGC GTTACT GGTATAC C GGATGAAGC C GCGGGCGA 
GC TT CCAGC TGC AGGT GTT GT AGT AC AGAC TGGAAAAT AT CT AAAC GAAC AAAT CGTACA 
AAATTTTGTTTCCAGTCAAGTTTCAACAGCCAAATGGCTACGTGGTGGGGTGAAATTTTT 
GGATGAAATTCCCAAAGGATCAAC TGGAAAAATT GACAGAAAAGTGTTAAGACAAATGTT 
T GAAAAAC AC ACCAATGGG * 
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GGATCCAATGGCAGATAAAAATAT TT TATATGGGCC CGAACC AT T TTATC C C TTGGCT GA 
TGGGAC GGCTGGAGAACAGATGTTTGACGCATTATCTC GTTATGC AGATATTTCAGGATG 
CATAGC ATTGAC AAAT GC TC AT AC AAAAGAAAATGT TT TATATGAAGAGTTT TTAAAATT 
GTCGTGTC GTTT AGC GGAAAGTTT TAAAAAGT ATGGAT TAAAAC AAAAC GAC AC AATAGC 
GGTGTGTAGCGAAAATGGTTTGCAATTTTTCCTTCCGTAATTGCATCATTGTATCTTGGA 
AT AATT GCAGC ACC TGTTAGTGAT AAATAC ATTGAACGTGAATT AATACACAGT CTTGGT 
ATTGTAAAACCACGCATAATTTTTTGCTCCAAGAAT AC TTTTCAAAAAGTAC TGAATGTA 
AAATCT AAATTAAAATCTGTAGAAAC TATT AT TATATT AGAC TT AAAT GAAGAC TT AGGA 
GGTTAT CAATGC CTCAAC AAC TTTATTTCT CAAAAT TC C GAT ATT AATCT T GAC GT AAAA 
AAATTT AAAC CATATTCTTT TAAT C GAGAC GATC AGGTTGC GTT GGTAAT GT TT TC TT CT 
GGTACAACTGGTGTTTCGAAGGGAGTCATGCTAACTCACAAGAATATTGTTGCACGATTT 
TC TC TTGCAAAAGATC C TAC TTTT GGTAAC GC AATT AAT C CCAC GACAGC AATT TTAACG 
GTAATACCTTTCCACCATGGTTTTGGTATGAt gACC AC ATTAGGATAC TTTACTTGTGGA 
TTCCGAGTTGTTCTAATGCACACGTTTGAAGAAAAACTATTTCTACAATCATTACAAGAT 
T ATAAAGT GGAAAGTAC TTTAC TT GT ACCAAC ATTAAT GGC ATT T CTTGC AAAAAGTGCA 
TTAGTT GAAAAGTACGATTTATCGCACTTAAAAGAAATTGCATC TGGT GGCGCACC TTTA 
T C AAAAGAAATT GGGGAGAT GGT GAAAAAAC GGT T T AAAT T AAAC TT T GT CAGG CAAGGG 
TATGGATTAACAGAAACCACTTCGGCTGTTTTAATTACACCGAAAxxxxxxGCCAGACCG 
GGATCAAC TGGTAAAATAGTACCATTTCAC GCTGTTAAAGTTGTC GATCC TACAAC AGGA 
AAAATT TTGGGGCC AAATGAACCTGGAGAATTGTATTTTAAAGGC GC CATGATAATGAAG 
GGTTAT TATAATAATGAAGAAGC TAC T AAAGC AAT T AT TGATAATGAC GGAT GGTT GC GC 
TCTGGTGATATT GC TTATTATGACAATGAT GGCCATTTTTATATTGTGGACAGGCTGAAG 
TC ATTAAT TAAATAT AAAGGTT AT CAGGTT GC ACC TGC TGAAATT GAGGGAATACT CTT A 
C AAC AT CC GT ATAT TGTT GATGC C GGC GT T AC TGGT AT TCC GGATGAAGC C GCGGGC GAG 
CTTCCAGC TGCAGGTGTTGTAGTACAGACT GGAAAATATCTAAAC GAACAAATC GTACAA 
GATTTTGTTTCCAGTCAAGTTTCAACAGCCAAATGGCTACGTGGTGGGGTGAAATTTTTG 
. GATGAAATTC CC AAAGGATC AAC T C»GAAAAATTGAC AGAAAAGTGTTAAGAC AAAT GTTT 
GAAAAACACACCAATGGG* 
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GGATC CAATGGCAGAT AAAAAT AT TTTATATGGGCCCGAAC CATT TTATC C C TT GGCT GA 
TGGGAC GGC TGGAGAACAGATGTT TT AC GCATTATC TC GTTATGC AGATATTT C AGGATG 
C ATAGC ATTGAC AAATGC TC AT AC AAAAGAAAAT GTTTTATATGAAGAGT TT TT AAAAT T 
GTC GTGTC GTTT AGC GGAAAGTTT TAAAAAGTAT GGATTAAAAC AAAAC GAC AC AAT AGC 
GGTGTGTAGCGAAAATGGTT TGCAATTT TTCCTTCC TGTAATTGC ATC ATTGTATC TTGG 
AAT AAT TGCAGC AC CTGT TAGT GATAAATACATT GAAC GT GAATT AAT AC AC AGTC TT GG 
T ATTGT AAAACC AC GC AT AATTTT TT GC TC CAAGAATACTTTTC AAAAAGTACTGAATGT 
AAAATC TAAATT AAAATATGTAGAAAC TATTATTATATTAGACTTAAATGAAGAC TTAGG 
AGGTTATC AATGCC TC AACAAC TTT ATT TC TCAAAATTC C GATAT TAATC TT GAC GTAAA 
AAAATTTAAAC CATAT TC TT TTAATC GAGAC GAT CAGGTT GC GTT GGTAATGTTTT CTTC 
T GGT ACAACTGGTGTTCC GAAGGGAGTC AT GC TAAC TCACAAGAATAT TGTT GC AC GATT 
TTCTCTTGCAAAAGATCC TACTTT TGGTAACGCAATTAATCCAAC GACAGCAATTTTAAC 
GGTAATAC CTTT CC ACCATGGT TT TGGT AT GATGAC C ACATTAGGATACT TTAC TTGTGG 
ATTCCGAGTTGTTC TAATGC AC AC GTTTGAAGAAAAACTATTTCTACAATCATTACAAGA 
TTATAAAGTGGAAAGTACTTTACTTGTACCAACATTAATGGCATTTCTTGCAAAAAGTGC 
AT TAGTTGAAAAGTACGATTTATCGC AC TTAAAAGAAATTGCATCTGGTGGC GCACCTTT 
ATC AAAAGAAAT TGGGGAGATGGT GAAAAAAC GGTT TAAATTAAACTTTGTC AGGC AAGG 
GT ATGGAT TAAC AGAAAC CACT TC GGCT GT TT TAAT TACACC GAAAxxxxxxGTCAGAC C 
GGGATC AAC TGGTAAAAT AGTAC C AT TT CAC GC T GT TAAAGT TGTC GAT C CT AC AAC AGG 
AAAAAT TTT GGGGC CAAATGAACC TGGAGAATTGTATT TTAAAGGC GACATGATAATGAA 
AGGTTATTATAATAATGAAGAAGC TAC TAAAGCAATT ATTGATAAAGAC GGATGGTTGC G 
C T CTGGTGATAT TGC TTATT AT GAC AAT GATGGC CATT TTTATATTGTGGAC AGGC TGAA 
GT C ATT AATTAAATATAAAGGT TAT C AGGT T GCACC TGCT GAAATTGAGGGAAT AC TC TT 
AC AACATC C GTATATTGTTGAT GC C GGC GT TAC T GGT AT AC C GGATGAAGCC GC GGGC GA 
GC TT C C AGC TGC AGGTGT TGTAGTAC AGAC TGGAAAATATC T AAACGAAC AAAT CGTACA 
AAAT TT TGTTTC CAGTCAAGTTTCAACAGC CAAATGGC TACGGGGTGGGGTGAAAT TTTT 
GGAT GAAAT T C C CAAAGGAT CAAC T GGAAAAATT GACAGAAAAGT GTT AAGAC AAAT GTT 
T GAAAAAC ACACCAATGGG * 
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GGATCCAATGGCAGAT AAAAAT AT TT TATATGGGCC CGAACC AT T TTATC CC TT GGCT GA 
TGGGACGGCTGGAGAACAGATGTTTGAC GCATTATCTC GTTATGCAGATATTCC CGGATG 
C ATAGC AT TGAC AAAT GC T C ATAC AAAAGAAAATGTTT TATATGAAGAGT TT TT AAAATT 
GT CGTGTC GT TT AGC GGAAAGTTTTAAAAAGT ATGGATTAAAAC AAAACGAC AC AATAGC 
GGTGTGTAGC GAAAAT GGTTTGCAATAT TTCC TTCC TGTAAT TGC ATC AT TGTATC TT GG 
AATAATTGCAGCACCT GTTAGTGATAAATACATT GAAC GTGAAT T AATAC AC AGTC TT GG 
TATTGTAAAACCACGCATAATTTTTTGCTC CAAGAATACTTT TC AAAAAGTACTGAATGT 
AAAATC TAAATTAAAATATGTAGAAAC TAT TATT AT AT TAGACT T AAATGAAGACT TAGG 
AGGT TATC AATGCCTCAACAAC TT TATTTC TC AAAATTCC GATAT TAATC TT GACGTAAA 
AAAATT TAAACCAAAT TC T TTT AATC GAGACGAT CAGGTT GC GT T GGT AATGTT TT CT TC 
T GGTAC AACT GGTGTTCC GAAGGGAGTC ATGC TAAC TC AC AAGAATAT TGTT GC AC GATT 
T T CTATTGCAAAAGAT CC TAC TTT TGGTAACGCAAT TAATC C AAC GAC AGCAAT TT TAAC 
GGTAATAC CTTTCCACCATGGTTTTGGTATGATGAC CACATTAGGATACTTTACTTGT GG 
AT TC C GAGTT GTTC TAAT GC AC AC GTTTGAAGAAAAACTATT TC T ACAAT CATT AC AAGA 
TTAT AAAGTGG7VAAGTAC TTTACT TGTACCAACATTAATGGC ATT TC TTGCAAAAAGT GC 
ATTAGTTGAAAAGTAC GATTTATC GC AC TT AAAAGAAATTGC AT C TGGTGGC GC AC CT TT 
AT CAAAAGAAAT TGGGGAGATGGTGAAAAAAC GGTT TAAATTAAAC TTT GTC AGGC AAGG 
GTATGGATTAACAGAAACCACTTCGGCTGTTTTAATTACACCGAAAxxxxxxGCCAGACC 
GGGAT C AACT GGTAAAAT AGTAC CAT T TCAC GC T GT TAAAGT TGTCGAT C CT AC AACAGG 
AAAAAT TT T GGGGC CAAAT GAAC C T GGAGAAT T GTATT TT AAAGGC GC CAT GAT AATGAA 
GGGTTATTATAATAAT GAAGAAGC TAC TAAAGCAATTATTGATAAAGAC GGATGGTTGC G 
C TCTGGTGATATTGC TTATTATGACAATGATGGCCATTTTTATATTGTGGAC AGGC TGAA 
GT CATTAATTAAATATAAAGGTTATC AGGTTGCACC TGCTGAAATTGAGGGAATAC TC TT 
AC AACATC C GT ATAT T GT TGAT GC C GGC GTTAC T GGTATAC C GGATGAAGC C GC GGGC GA 
GC TT C C AGCT GCAGGT GT TGTAGT AC AGAC TGGAAAAT ATCTAAACGAAC AAAT CGTACA 
AAATTTTGTT TC CAGTCAAGTTTCAACAGC CAAATGGC TACGTGGTGGGGTGAAATTTTT 
GGAT GAAATT C C CAAAGGAT CAAC T G GAAAAAT T GACAGAAAAGT GT T AAGACAAATGT T 
T GAAAAAC ACACCAATGGG * 
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GGAT CC AATGGCAGATAAAAATATTTTATATGGGCC CGAACC ATTTTATC C C TT GGCT GA 
TGGGAC GGCT GGAGAACAGATGTTT6AC GC AT TATC TC GTTATGCAGATATTC C CGGATG 
C ATAGCAT TGACAAATGCTCAT AC AAAAGAAAAT GTTTTATATGAAGAGT TT TT AAAATT 
GTCGTGTCGTTTAGCGGAAAGTTTTAAAAAGTATGGATTAAAACAAAACGACACAATAGC 
GGTGTGTAGCGAAAATGGTTTGCAATTTTTCCTTCCTGTAATTGCATCATTGTATCTTGG 
AATAAT TGCAGCAC CTGTTAGTGATAAATACGTTGAAC GTGAAT TAATAC AC AGTCTT GG 
TATT GT AAAACC AC GCATAATTTTTT GCTC CAAGAATACTTT TC AAAAAGTACT GAAT GT 
AAAATCTAAATTAAAATATGTAGAAACTATTATTATATTAGACTTAAATGAAGACTTAGG 
AGGT TATC AATGCC TC AACAAC TTTATT TC TC AAAATT C C GATAGTAATC TGGAC GTAAA 
AAAATT TAAACC AAATTCTT TTAATC GAGACGATCAGGTTGC GT TGGT AATGTTTTCTTC 
T GGTACAACT GGTGTTCC GAAGGGAGT CAT GC TAAC TCAC AAGAATAT TGTT GC AC GATT 
T TC T CTTGCAAAAGAT CC TACTTTTGGTAACGCAAT TAAT C C AAC GAC AGCAAT TTTAAC 
GGTAATAC CTTTCCAC CATGGTTTTGGTATGATGACCACATTAGGATACTTTAC TTGTGG 
ATTC C GAGTT GTTC T AATGC ACAC GTTTGAAGAAAAAC TATT TC T ACAAT C ATT AC AAGA 
T TAT AAAGTGGAAAGTAC TT TACTTGTAC C AAC ATTAATGGC AT TT C TTGCAAAAAGT GC 
AT TAGTT GAAAAGT AC GATT TATC GCAC TT AAAAGAAATTGC AT C TGGTGGC GC AC CTTT 
AT CAAAAGAAATTGGGGAGATGGTGAAAAAAC GGT TTAAAT TAAACTT TGTC AGGC AAGG 
GT AT GGAT TAAC AGAAAC CACT TC GGC T GT TT TAATTACACC GAAAxxxxxxGCCAGACC 
GGGATCAACT GGTAAAAT AGTACCATTTCACGCTGTTAAAGTTGTCGATC CTAC AACAGG 
AAAAAT TT TGGGGC CAAATGAACCT GGAGAAT TGTATT TT AAAGGC GC CAT GAT AATGAA 
GGGT TATT AT AATAAT GAAGAAGC T ACTAAAGC AATTATTGATAAAGACGGATGGTTG CG 
C T CT GGTGAT ATTGCT T ATTAT GACAAT GATGGC CATT TTTATATTGT GGAC AGGC T GAA 
GTCATT AATT AAAT ATAAAGGT TATC AGGT TGCAC C TGC TGAAATTGAGGGAAT AC TC TT 
ACAACATC CGTATATTGTTGATGCCGGC GTTACTGGTATACCGGATGAAGCC GCGGGC GA 
G C TT CC AGCT GC AGGT GTTGTAGT AC AGAC TGGAAAATATC TAAACGAAC AAAT C GTACA 
AAAT TT TGTTTCCAGT CAAGTTTCAACAGC CAAATGGCTACGTGGTGGGGTGAAATTTTT 
GGAT GAAATT CC CAAAGGAT CAAC TGGAAAAAT TGACAGAAAAGT GTTAAGACAAAT GTT 
T GAAAAAC AC ACCAATGGG * 
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GGAT CC AATGGCAGAT AAAAAT AT TT TATATGGGCCCGAACC AT TTTATC CC TT GGCT GA 
T GGGAC GGC T GGAGAACAGATGTT TGAC GCATTATC TC GTTATGC AGATATT CC GGGC TG 
C ATAGC ATTGAC AAAT GCTCATAC AAAAGAAAATGTTTTATATGAAGAGT TT TT AAAATT 
GTCGTGTC GTTTAGCGGAAAGT TTTAAAAAGTATGGATTAAAACAAAACGACAC AATAGC 
GGTGTGTAGC GAAAAT GGTT TGCAAT TTTTCCTTCC TGTAATTGC ATC ATTGTATC TT GG 
AAT AAT TG TGGC AC CT GT TAAC GAT AAAT ACAT T GAAC GT GAAT T AAT AC AC AGT C T T GG 
TATT GT AAAACCAC GC AT AGTT TT TT GCTC C AAGAATACTTT TC AAAAAGTACTGAAT GT 
AAAATCTAAATTAAAATCTGTAGAAACT AT TATTATATTAGACT TAAAT GAAGAC TTAGG 
AGGTTATCAATGCC TCAACAAC TT TATTTC TC AAAATTC C GATAT TAATC T T GAC GTAAA 
AAAATTTAAACC ATAT TCTT TT AATC GAGACGATCAGGTTGC GTT GATTATGTT TTC T TC 
TGGTACAACTGGTCTGCC GAAGGGAGTCATGCTAACTCACAAGAATATTGTTGCAC GATT 
T T C TC TTGCAAAAGAT CCTACT TTT GGTAACGC AATTAATC CCAC GAC AGCAAT T TTAAC 
GGTAATACCT TTCC AC CATGGTTTTGGT ATGATGACCACATTAGGATACT TT AC T T GTGG 
AT TC C GAGTT GTTC TAATGC AC AC GTTTGAAGAAAAAC TATT TC T ACAAT CAT T AC AAGA 
T T ATAAAGTGGAAAGT AC TT TAC TTGTACC AACATTAATGGC ATT TC TTGCAAAAAGTGC 
ATTAGTTGAAAAGTAC GATT TAT C GC AC TTAAAAGAAATTGC ATC TGGTGGC GC ACC TTT 
AT CAAAAGAAATTGGGGAGATG GT GAAAAAAC GGT T TAAATT AAACTT TGT C AGGC AAGG 
GTATGGAT TAACAGAAAC CACT TC GG C T GT TT T AAT TAC AC C GAAAxxxxxxGC CAGAC C 
GGGATC AACT GGTAAAAT AGTACC AT TTC AC GC TGT TAAAGT TGT C GATC C TAC AACAGG 
AAAAATTTTGGGGC CAAATGAACC TGGAGAATTGTATTTTAAAGGC CCGATGATAATGAA 
GGGTTAT TAT AAT AAT GAAGAAGC TAC T AAAGCAAT TATTGATAATGAC GGAT GGT T GC G 
C T C T GGTGAT AT TGCT TATT AT GACAATGATGGC CATTTTTAT AT TGTGGAC AGGC T GAA 
GT CATTAATTAAATATAAAGGT TATC AGGTTGCAC C TGCTGAAAT TGAGGGAAT ACT C TT 
AC AACATC C GTATATT GTTGAT GC C GGC GT TACT GGTAT TC C GGATGAAGC C GC GGGC GA 
GC TT C C AGC T GC AGGT GTTGTAGT AC AGAC TGGAAAAT ATCT AAACGAAC AAATC GTACA 
AGATTTTGTT TCCAGTCAAGTTTCAACAGC CAAATGGCTACGTGGTGGGGTGAAATTTTT 
GGAT GAAATT C C CAAAGGAT CAAC TGGAAAAATTGACAGAAAAGTGTT AAGACAAATGTT 
T GAAAAAC ACACCAATGGG 
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GGAT CCAATGGCAGATAAGAAT ATTT TATATGGGCC CGAACCATT TTATCC C TT GGAAGA 
T GGGACGGC TGGAGAACAGATGTTTGAC GC AT TATC TC GT TATGCAGATATTC C GGGCTG 
CATAGCATTGAC AAATGC TC AT AC AAAAGAAAAT GT TT TATATGAAGAGTTTCTGAAACT 
GT C GTGTCGT TTAGC GGAAAGTTT TAAAAAGTAT GGATTAAAACAAAACGACAC AATAGC 
GGTGTGTAGCGAAAATGGTCTGCAATTTTTCCTTCCTGTAATTGCATCATTGTATCTTGG 
AATAATTGTGGCAC C T GT TAAC GATAAATACATT GAACGT GAATT AATAC AC AGTC TTGG 
T ATT GTAAAACCAC GC ATAATT TT TT GCTC CAAGAATACTTTTCAAAAAGTAC T GAAT GT 
AAAATC TAAATT AAAATCTGTAGAAACTAT TATT AT ATTAGACTT AAATGAAGAC TTAGG 
AGGTTATC AATGCCTCAACAAC TT TATTTC TCAAAATTCC GATATTAATCTTGACGTAAA 
AAAATT TAAACC ATATTC TT TT AATC GAGACGAT CAGGTT GC GTT GTTAATGT T TTC TTC 
TGGT AC AAC TGGTCTGCC GAAGGGAGTC AT GC TAAC TC AC AAGAATAT TGTTGCAC GATT 
TT CTCT TGC aAAAGATC C TACT TTT GGT AACGCAATTAAT CC CAC GACAGCAATT T TAAC 
GGTAATAC CTTT CC AC CATGGT TTTGGT AT GATGAC CACATTAGGATAC TTTACTTGT GG 
AT TC C GAGTTGTTC TAAT GC AC AC GT TTGAAGAAAAAC TATT TC TACAAT CATTAC AAGA 
T TAT AAAGTGGAAAGT AC TT TAC TTGTACC AACATTAATGGC ATT TCTTGCAAAAAGTGC 
AT TAGTTGAAAAGT AC GATTTATC GC AC TT AAAAGAAATT GC ATC TGGTGGC GCAC CTTT 
AT CAAAAGAAATTGGGGAGATGGTGAAAAAAC GGTT TAAATT AAAC TT TGTCAGGCAAGG 
GT AT GGATTAAC AGAAACCACT T C GGC TGTTT TAAT TACACC GAAAxxxxxxGC CAAAC C 
GGGATCAACTGGTAAAATAGTACCATTTCACGCTGTTAAAGTTGTCGATCCTACAACAGG 
AAAAATTTTGGGGCCAAATGAACCTGGAGAATTGTATTTTAAAGGCCCGATGATAATGAA 
GGGT TAT T AT AAT AAT GAAGAAGC TAC T AAAGCAAT TATT GATAATGAC GGAT GGT T G C G 
C TCT GGTGATATTGCTTATTATGACAATGATGGC CATTTTTATATTGTGGAC AGGCTGAA 
GT CACTGATTAAAT ATAAAGGT TATC AGGT TGCACC TGCTGAAAT TGAGGGAAT AC TC TT 
AC AACATC C GTAT ATT GT TG AT GC CGGC GT TAC T GGTATTCC GGATGAAGC C GC GGGC GA 
GC TT CC AGC TGCAGGTGT TGTAGT AC AGAC TGGAAAAT AT C TAAACGAAC AAAT C GTACA 
AGATTATGTTGCCAGTCAAGTTTCAACAGCCAAATGGCTACGTGGTGGGGTGAAATTTTT 
GGAT GAAAT T C C CAAAGGAT CAAC T GGAAAAAT T GACAGAAAAGT GT T AAGACAAATGT T 
TGAAAAACACACCAATGGG 
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2) 

GGAT CC AATGGAAGATAAAAATATTTTATATGGAC C TGAACC AT TTTATC CC TT GGCT GATGGGAC GGC TGGAGAACAG 

ATGTTTTACGCATTATCTCGTTATGCAGATATTTCAGGATGCATAGCATTGACAAATGCTCATACAAAAGAAAA 
TATATGAAC^GTTTTTAAAATTGTCGTGTCGTTTAGCGGAAAGTTTT^^ 

AGCGGT GTGTAGCGAAAATGGTTTGCAATTTTTC CT TC CTTTAAT TGC ATCATTGTAT CTTGGAATAATT GC AGCACC T 
GTTAGT GATAAATACATT GAAC GTGAAT TAATAC AC AGTC TT GGTATT GT AAAACCAC GC AT AATT TTTT GTTC CAAGA 
ATACTTTTCAAAAAGTACTGAATGTAAAATCTAAAT TAAAATAT GTAGAAAC TATT ATTATATT AGAC TT AAAT GAAGA 
CTTAGGAGGTTATCAATGCCTCAACAAC TTTATTTC TCAAAATTCCGATATTAATCTTGACGTAAAAAAATT TAAACCA 
AATT CT TT TAAT CGAGAC GATCAGGT TGC GTTGGTAATGTTT TC TTCT GGTACAAC TGGT GTTT CGAAGGGAGT CATGC 
TAAC TC ACAAGAATATTGTTGCACGATTTTCTCATTGC AAAGATCCTACTTT TGGTAACGCAATTAATCCAACGACAGC 
AATTTTAACGGTAATACCTTTC CACCATGGTTTTGGTATGATGACCAC ATTAGGATACTTTACTTGTGGATT CC GAGTT 
GC TC TAATGC AC AC GT TTGAAGAAAAAC TATTTC TACAATCATTACAAGATT ATAAAGTGGAAAGT ACTTTAC TTGTAC 
C AAC AT TAAT GGCATT TT TT GCAAAAAGTGCATTAGTT GAAAAGT AC GATT TAT CGCACTTAAAAGAAAT TGCATC TGG 
T GGC GC AC CT TT AT CAAAAGAAATTGGGGAGATGGT GAAAAAAC GGTT TAAATTAAAC TTTGTC AGGC AAGGGTATGGA 
T TAACAGAAACC AC TT CGGC TGTTTT AATTAC AC C GGACACT GAC GTC AGAC CGGGAT CAAC TGGTAAAATAGTAC CAT 
T TCACGC TGT TAAAGT TGTC GATC CT AC AACAGGAAAAATTTTGGGGC C AAATGAAAC TGGAGAAT TGTATTTT AAAGG 
C GAC AT GATAAT GAAAAGTT AT TATAAT AATGAAGAAGC TAC TAAAGC AATTAT TAAC AAAGAC GGAT GGTT GC GC TC T 
GGTGAT AT TGCT TATT AT GAC AATGATGGCCATTTTTAT ATT GTGGAC AGGC TGAAGT CATTAATT AAAT ATAAAGGTT 
AT CAGGTT GC AC CT GC TGAAAT TG AGGGAATACT C TTACAAC ATC CGT AT AT TGT T GATGC C GGCGTTAC TGGTATACC 
G GATGAAGCC GC GGGC GAGC TT C C AGCTGC AGGT GT TGTAGT AC AGAC T GGAAAAT AT CT AAAC GAAC AAAT C GTACAA 
AATTT T GT TT CC AGTC AAGT TT CAAC AGC C AAAT GGC TAC GTGGT GGGGT GAAATT TT TGGATGAAAT TC C C AAAGGAT 
CAAC TGGAAAAATT GACAGAAAAGTGTTAAGAC AAATGTTTGAAAAAC AC AAATC TAAGC TG 
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GGATCC CAT GATGAAGCGAGAGAAAAATGTTATATAT GGACCCGAACCCCTACACCC CTT 
GGAAGACTTAACAGCTGGAGAAATGCTCTTCCGTGCCCTTCGAAAACATTCTCATTTACC 
GCAGGCTTTAGTAGATGTGGTTGGCGACGAATCGCTTTCCTATAAAGAGTTTTTTGAAGC 
GACAGTCCTCCTAGCGCAAAGTCTCCACAATTGTGGATACAAGATGAATGATGTAGTGTC 
GATCTGCGCCGAGAATAATACAAGATTTTTTATTCCCGTTATTGCAGCTTGGTATATTGG 
TATGATTGTAGCACCTGTTAATGAAAGTTACATCCCAGATGAACTCTGTAAGGTGATGGG 
TATATCGAAACCACAAATAGTTTTTACGACAAAG7VACATTTTAAATAAGGTATTGGAGGT 
ACAGAGCAGAACTAATTTCATAAAAAGGATCATCATACTTGATACTGTAGAAAACATACA 
CGGTTGTGAAAGTCTTCCCAATTTTATTTCTCGTTATTCGGATGGAAATATTGCCAACTT 
CAAACCTTTACATTTCGATCCTGTTGAGCAAGTGGCAGCTATCTTATGTTCGTCAGGCAC 
TACTGGATTACCGAAAGGTGTAATGCAAACTCACCAAAATATTTGTGTCCGACTTATACA 
TGCTTTAGACCCCAGGGCAGGAACGCAACTTATTCCTGGTGTGACAGTCTTAGTATATCT 
GCCTTTTTTCCATGCTTTTGGGTTCTCTATAACCTTGGGATACTTCATGGTGGGTCTTCG 
TGTTAT GATGTTCAGACGATTT GAT CAAGAAGCATTTCTAAAAGCTATT CAGGATTAT GA 
AGTTCGAAGTGTAATTAACGTTCCATCAGTAATATTGTTCTTATCGAAAAGTCCTTTGGT 
TGACAAATAC GAT TTATCAAGTTTAAGGGAATT GT GT T G CGGTGCG GCACCAT TAGCAAA 
AGAAGTTGCTGAGGTTGCAGCAAAACGATTAAACTTGCCAGGAATTCGCTGTGGATTTGG 
TTT GACAGAATCTAGTT CAGCTAATATACACAGT CTTAGGGAT GAATTTAAATCAGGATC 
ACTTGGAAGAGTTACTCCTTTAATGGCAGCTAAAATAGCAGATAGGGAAACTGGTAAAGC 
ATTGGGACCAAATCAAGTTGGTGAATTATGCATTAAAGGTCCCATGGTATCGAAAGGTTA 
CGT GAACAAT GTAGAAGCT AC CAAAGAAGCTATTGAT GATGAT GGTT GGCTTCACT CT GG 
AGACTTT GGATACTAT GAT GAG GAT GAGCATTT CTATGTGGTGGACCGTTACAAGGAATT 
GATTAAATATAAGGG CT CT CAGGTAGCAC CT GCAGAACTAGAAGAGATTTTATT GAAAAA 
TCCATGTATCAGAGAT.GTTGCTGTGGTTGGTATTCCTGATCTAGAAGCTGGAGAACTGCC 
ATCTGCGTTTGTGGTTAAACAGCCCGGAAAGGAGATTACAGCTAAAGAAGTGTACGATTA 
TCTTGC CGAGAGGGT.CTC CCATACAAAGTATTTGCGT GGAG'GGGTT CGAT T CGTTGATAG 

CATACCAAGGAATGTTACAGGTAAAATTACAAGAAAGGAACTT CT GAAG CAGTT GCT GGA 
GAAGGCGGGAGGT 
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GGAT CC CATGATGAAGCGAGAGAAAAAT GTT ATATAT GGAC C C GAACCCCTAGACCCCTT 

GGAAGACTTAACAGCTGGAGAAATGCTCTTCCGTGCCCTTCGAAAACATTCTCATTTACC 

GCAGGCTTTAGTAGATGTGGTTGGCGACGAATCGCTTTCCTATAAAGAGTTTTTTGAAGC 

GACAGTCCTCCTAGCGCAAAGTCTCCACAATTGTGGATACAAGATGAATGATGTAGTGTC 

GATCTGCGCCGAGAATAATACAAGATTTTTTATTCCCGTTATTGCAGCTTGGTATATTGG 

TAT GATT GTAG CAC CT GTTAAT GAAAGTTACATCCCAGATGAACTCT GTAAGGTGATGGG 

TATATC GAAAC CACAAATAGTTTTT ACGACAAAGAACATTTTAAAT AAGGTATT G GAGGT 

ACAGAGCAGAACTAATTTCATAAAAAGGATCATCATACTTGATACTGTAGAAAACATACA 

CGGTTGTGAAAGTCTTCCCAATTTTATTTCTCGTTATTCGGATGGAAATATTGCCAACTT 

CAAACCTTTACATTTCGATCCTGTTGAGCAAGTGGCAGCTATCTTATGTTCGTCAGGCAC 

TACTGGATTACCGAAAGGTGTAATGCAAACTCACCAAAATATTTGTGTCCGACTTATACA 

TGCTTTAGACCCCAGGGCAGGAACGCAACTTATTCCTGGTGTGACAGTCTTAGTATATCT 

GCCTTTTTTCCATGCTTTTGGGTTCTCTATAACCTTGGGATACTTCATGGTGGGTCTTCG 

TGTTATCATGTTCAGACGATTTGATCAAGAAGCATTTCTAAAAGCTATTCAGGATTATGA 

AGTTCGAAGTGTAATTAACGTTCCATCAGTAATATTGTTCTTATCGAAAAGTCCTTTGGT 

TGACAAATACGATTTATCAAGTTTAAGGGAATTGTGTTGCGGTGCGGCACGATTAGCAAA 

AGAAGTTGCTGAGGTTGCAGCAAAACGATTAAACTTGCCAGGAATTCGCTGTGGATTTGG 

TTT GACAGAATCTACTT CAG CTAATATACACAGTCTTAGGGAT GAATTTAAATCAGGAT C 

ACTT GGAAGAGTTACT CCTTTAAT G GCAGCTAAAATAGCAGATAGGGAAACT GGTAAAGC 

ATTGG GAC CAAATCAAGTTGGT GAATTAT GCATTAAAGGT C CCAT GGTATCGAAAGGTTA 

C GTGAACAAT GTAGAAGCTAC CAAAGAAGCTATT GATGAT GAT GGTTGGCTT CACT CTGG 

AGACTTTGGATACTATGATGAGGATGAGCATTTCTATGTGGTGGACCGTTACAAGGAATT 

GATTAAATATAAGGGCT CTCAGGTAGCAC CT GCAGAACTAGAAGAGATTTTATT GAAAAA 

TCCATGTATCAGAGATGTTGCTGTGGTTGGTATTCCTGATCTAGAAGCTGGAGAACTGCC 

AT CT GC GTTTGT GGTTAAACAGCC CGGAAAGGAGATTACAGCTAAAGAAGT GTACGATT A 

TCTTGCCGAGAGGGTCTCCCATACAAAGTATTTGCGTGGAGGGGTTCGATTCGTTGATAG 

CATACCAAGGAATGTTACAGGTAAAATTACAAGAAAGGAACTTCTGAAGCAGTTGCTGGA 

GAAGGCGGGAGGT 
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Wood and Hall 

17. A DN A molecule having a nucleotide sequence that encodes a 
luciferase of claim 1 or 7. 

1 8. The use of luciferases of claims 1 or 7 in ATP assays; as 
luminescent labels for nucleic acids, proteins, or other macromolecules; as genetic 
reporters; in enzyme immobilization; as hybrid proteins; in high temperature 
reactors; and in luminescent solution. 

19. A kit comprising a beetle luciferase with a half-life of at least 2 
hours at 50°C. 

20. The kit of claim 19 used for ATP assays; as luminescent labels for 
nucleic acids, proteins, or other macromolecules; as genetic reporters; in enzyme 
immobilization; as hybrid proteins; in high temperature reactors; and in 
luminescent solution. 

21 . A luciferase having an amino acid sequence consisting of 
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DPMEDKNILYGPEPFYPLADGTAGEQMFYALSRYADISGCIALTNAHTKENVLYEEFLKL 
S C RLAE S FKKYGLKQNDT I AVC SENGLQFFLPII AS L Y LG 1 I AAPVS DKY I E RE L I HS LG 

ivkpriifcskntfqk\^nvksklkyvetiiildij^edix;gyqclnnfisqnsdinldvk 

KFKPYSFNRDDQVALVMFSSGTTGVSKGVMLTHKNIVARFSLAKDPTFGNAINPTTAILT 
VIPFHHGFGMMTTLGYFTCGFR\AnjMHTFEEKLFMSLQDYKVESTLLVPTLMAFLAKSA 
LVEKYDLSHLKEIASGGAPLSKEIGEMVKKRFKLNFVRQGYGLTETTSAVLITPNNDVRP 
GSTGKI VPFHAVKWDPTTGKI LGPNETGELYFKGDMIMKGYYNNEEATKAI INKDGWLR 
SGDIAYYDNDGHFYIVDRLKSLIKYKGYQVAPAEIEGILLQHPYIVDAGVTGIPDEAAGE 
LPAAGVWnTGKYLNEQI VQNFVS SQVSTAKWLRGGVKFLDEI PKGSTGKI DRKVLRQMF 
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DPMEDKNILYGPEPFYPLADGTAGEQMFYALSRYADISGCIALTNAHTKENVLYEELLKL 
SCRLAESFKKYGLKQNDTIAVCSENGLQFFLPIIASLYLGIIAAPVSDKYIERELIHSLG 
I VKPRI I FCS KNTFQKVLNVKS KLKYVETI 1 1 LDLNEDLGGYQC LNNFI SQNSD I NLDVK 
KFKP YS FN RD DQVALVMFS S GT T GVS KGVMLTHKN I VAR FSHAKD PT FGNAI NP TTAI LT 
VIPFHHGFGMMTTLGYFTCGFRVVLMHTFEEKLFLQSLQDYKN^ 

LVEKYDLSHLKEIASGGAPLSKEIGEMVKKRFKLNFVRQGY'GLTETTSAVLITPNNDVRP 
GSTGKI VP FHAVKWD PTTGKI LGPNETGELYFKGDMIMKGY YNNEEATKAI INKDGWLR 
SGDI AYYDNDGHFY IVDRLKSLI KYKGYQVAPAE I EGI LLQHPYI VDAGVTG I P DEAAGE 
L P AAGWVQT GKYLNE Q I VQN FVS SQVSTAKWLRGGVKFLDE I PKGSTGKI DRKVLRQMF 
EKHTNG 
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DPMEDKNILYGPEPFYPLADGTAGEQMFYALSRYADISGCIALTNAHTKENVLYEEFLKL 
SCRLAESFKKYGLKQNDTIAVCSENGLQFFLPIIASLYLGIIAAPVSDKYIERELIHSLG 
I VKPRI I FCSKNTFQKVLNVKS KLKYVETI 1 1 LDLNEDLGGYQCLNNFI SQNSDINLDVK 
KFKPYSFNW)DQVALVMFSSGTTGVSKGVMLTHKNIVVRFSIAKDPTFGNAINPTTM LT 
VIPFHHGFQ^MTTLGYFTCGFRVVLMHTFEEKLFLQSLQDYKVESTLLVPTLMAFF 
LVEKYDLSHLKEIASGGA.PLSKEIGEMVXKRFKLNFVRQGYGLTETTSAVLITPNNDVRP 
GS TGKI VP FHAVKWD PTTGKI LGPNETGE L Y FKGDMI MKGY YNNEEATKAI I TKDGWLR 
S GDI AYYDNDGHF Y I VDRLKS L I KYKGYQVAPAE I EGI LLQH P Y I VDAGVTG I P DEAAGE 
L PAAGWVQT GKYLNE Q I VQNFVS S Q VS TAKWLRGGVKFLDE I P KGS TGKI D RKVL RQMF 
EKHTNG 
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j. ) 

DPMEDKNILYGPEPFYPLADGTAGEQMFYALSRYADISGCIALTNAHTKENVLYEEFLKL 
SCRLAESFKKYGLKQNDTIAVCSENGLQFFLPIIASLYLGIIAAPVSDKYIERELIHSLG 
IVKPRIIFCSKNTFQKVLNVKSKLKYVETIIILDI^EDLGGYQCIJWFISQNSDINLDVK 
KF KP YS FNRDDQVALVMFS S GTTGVS KGVMLTHKN I VARF5 IAKD PT FGNAI N P TTAI LT 
VIP FHHGFGMMTTLG Y FT CG FRWLMHT FEEKLFLQ SLQD YKVE S TLLVP TLMAFLAKS A 
LVEKYDLSHLKE I ASGGAPLSKEI GEMVKKRFKLNFVRQGYGLTETTS AVLI TPNNDVRP 
GSTGKI VPFHAVKVAmPTTGKI LGPNETGELYFKGDMIMKGY YNNEEATKAI INKDGWLR 
S GD I AY YDNDGH FY I VD RLKS L I KYKG YQVAP AE I E GI LLQH P Y I VDAGVTG I P DEAAGE 
LPAAGWVQTGKYLNEQI VQNFVS SQVSTAKWLRGGVKFLDE I PKGSTGKI DRKVLRQMF 
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D PMEDKNI LYGP EP FYPLADGT AGEQMFDALS R YAD I S GC I ALTNAHT KE NVL YEE FLKL 

SCRLAESFKKYGLKQNDTIAVCSENGLQFFLPIIASLYLGIIAAPVSDKYIERELIHSLG 

IVKPRI IFCSKNTFQKVLNVKSKLKYVETI I ILDLNEDLGGYQCLNNFISQNSDINLDVK 

KFKPYSFWRDDQYALVMFSSGTTGVSKGVMLTHKNIVARFSHAKDPTFGNAINPTTAILT 

VIPFHHGFC^1^T^TLGYFTCGFRV^^LMHTFEEKLF1^SLQDYK^^STLLVPTLMAFFAKSA 

LVEKYDLSHLKEIASGGAPLSKEIGEMVKKRFKLNFVRQGYGLTETTSAVLITPNNDVRP 

GSTGKIVPFHAVKWDPTTGKILGPNETGELYFKGDMIMKGYYNNEEATK7VI INKDGWLR 

SGDIAYYDNDGHFYIVDRLKSLIKYKGYQVAPAEIEGILLQHPYIVDAGVTGIPDEAAGE 

LPAAGVWQTGKYLNEQIVQNFVSSQVSTAKWLRGGVKFLDEIPKGSTGKIDRKVLRQMF 
EKHTNG 
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DPMADKNILYGPEPFYPLADGTAGEQMFDALSRYADISGCIALTNAHTKENVLYEEFLKL 
SCRIAESFKKYGLKQNDTIAVCSENGLQFFLPVIASLYLGIIAAPVSDKYIERELIHSLG 
I VKPRI IFCS KNTFQKVLNVKSKLKSVETI 1 1 LDLNEDLGGYQCLNNFISQNSDINLDVK 
KFKPYS FNRDIX}VALVMFSSGTTGVSKGVMLTHKNIVARFSIAKDPTFGNAINPTTAI LT 
VIPFHHGFGMMTTLGYFTCGFRVVI*MOTFEEKLFLQSLQDYKVES 

LVEKYDLSHLKEIASGGAPLSKEIGEMVKKRFKLNFVRQGYGLTETTSAVLITPKX3CARPG 
STGKIVPFHAVK\AO)PTTGKILGPNEPGELYFKGAMIMKGYYNNEEATKAIIDNDGWLRS 
GDI AYYDNDGHFYI VDRLKSLI KYKGYQVAPAEI EGILLQHP YI VDAGVTGI PDEAAGEL 
PAAGVWQTGKYLNEQIVQDFVSSQVSTAKWLRGGVKFLDEI PKGSTGKI DRKVLRQMFE 
KHTNG$ 
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f 1 

DPMADKNI LYGPEPFYPLADGTAGEQMF YALS RYAD I SGC I ALTNAHTKENVLYEEFLKL 
S C RLAE S FKKYGLKQNDT I AVC SE NGLQ FFL P VI AS L YLG 1 I AAP VS DKYIERELIHS LG 
IVKPRIIFCSKNTFQKVIiNVKSKLKYVETIIILDLNEDLGGYQCLNNFISQNSDINLDVK 
KFKPYSFNRDDQVALVMFSSGTTGVPKGVMLTHKNIVARFSLAKDPTFGNAINPTTAILT 
V I P FHHGFGMMTTLG Y FTC GFRWLMHT FE EKLFLQ S LQD YKVE S TLLVP T LMAFXAKS A 

LVEKYDLSHLKEIASGGAPLSKEIGElWKKRFKLNFVRQGYGLTETTSAVLITPKxxVRPG 
STGKIVPFHAVK\AmPTTGKILGPNEPGELYFKGDMIMKGYYNNEEATKAIIDKDGWLRS 
GDI AYYDNDGHFYI VDRLKSLI KYKG YQVAPAE I EGI LLQHP YIVDAGVTGI PDEAAGEL 

PAAGVVVQTGKYLNEQIVQNFVSSQVSTAKWLRGGVKFLDEIPKGSTGKIDRKVLRQMFE 
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J,) 



DPMADKNILYGPEPFYPLJU3GTAGEQMFDALSRYADIPGCIALTNAHTKENVLYEEFLKL 
SCRLAESFKKYGLKQNDTIAVCSENGLQYFLPVIASLYLGIIAAPVSDKYIERELrHSLG 
IVKPRIIFCSKNTFQKVLWKSKLKYVETIIILDI^EDIX^GYQCI^FISQNSDINLDVK 
KFKPNS FN RD DQVALVMF S SGTTGVPKGVMLTHKNI VARFS I AKDPTFGNAI NPTTAI LT 
VIPFHHGFGMMTTLGYFTCGFRVVLMOTFEEKLFLQSLQ 

LVEKYDLSHLKEIASGGAPLSKEIGEWKKRFKLNFNmQGYGLTETTSAVLITPKxxARPG 
STGKIVPFHAVKVTOPTTGKILGP^PGELYFKGAMIMKGYYNNEEATKAIIDKIXJWLRS 
GDIAYYDNDGHFYIVDRLKSLIKYKGYQVAPAEIEGILLQHPYIVDAGVTGIPDEAAGEL 
PAAGVWQTGKYLNEQIVQNFVSSQVSTAKWLRGGVKFLDEI PKGSTGKI DRKVLRQMFE 
KHTNG 
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■ i 



DPMADKNI LYGPEPFYPLADGTAGEQMFDALSRYADIPGC IALTNAHTKENVLYEEFLKL 
S C RLAE S FKKYGLKQNDT I AVC S E NGLQ FFLP VI AS L YLG 1 I AAP VS D KYVEREL I HS LG 
IVXPRIIFCSKNTFQKVLNVKSKLKYVETIIILDIJ^EDLGGYQCLNNFISQNSDS^DVK 
KFKPNS FNRDDQVALVMFSS GTTGVPKGVMLTHKNI VARFSIAKDPTFGNAI NPTTAI LT 
VIPFHHGFGMMTTLGYFTCGFRVVLMHTFEEKLFLQSLQDYKVES 

LVEKYDLSHLKEIASGGAPLSKEIGEMVKKRFKLNFVRQGYGLTETTSAVLITPKxxARPG 

STGKIVPFHAVK\An)PTTGKILGPNEPGELYFKGAMIMKGYYNNEEATKAIIDKDGWLRS 

GDIAYYDNDGHFYIVDRLKSLIKYKGYQVAPAEIEGILLQHPYIVDAGVTGIPDEAAGEL 

PAAGVWQTGKYLNEQIVQNFVSSQVSTAKWLRGGVKFLDEIPKGSTGKI DRKVXRQMFE 
KHTNG 
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) JS? ^ EKNVIYGPEPLHPLEDLTAGE MLFRALRKHSHLPQALVDWGDESLSYKEFPFA 

iskp Qivfttknilnk\o,evqsrtnfikriiild^ 
^"^ QV ^ ILCSSGTTGL ^ 

fro^o fi RELCCG ^ P ^ EVA ^ AAKRLN LPGIRCGFGLTESTSANIHSLRDEFKSGS 
LGR^PI^KIADRETGKALGPNQVGELCIKGPWSKGYW^TOFA^GwSs? 

dfgyydedehfywdrykelikykgsqvapaeleeillknpcxS^pS^ge^ 
^fwkqpgkeitakewdyi^ervshtkylrggvrfvdsiprnvtSkS 
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22. The luciferase of claim 21 further characterized as having a half-life 
of 2 hours at 50°C. 
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22. The luciferase of claim 21 further characterized as having a half-life 
of 2 hours at 50°C. 
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FIGURE 16 
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FIGURE 18A 
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FIGURE 18B 
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FIGURE 18C 
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FIGURE 19 
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XX 
XX 



DDKPGASGKV 
DDKPGASGKV 
DDKPGASGKV 
DDKPGACGKV 
DDKPGAVGKV 
DDKPGACGKV 
EFKLGAVGKV 
KIKTGSTGQV 
EFKSGSLGRV 
EFKSGSLGRV 
DVRPGSTGKI 

A 

A 



VPLFKAKVID 
VPLFKAKVID 
VPLFKVKVID 
VPFFTAKIVD 
VPFFEAKWD 
VPFFSAKIVD 
VPFYSLKVLD 
LPYVTAKIVD 
TPLMAAKIAD 
TPLMAAKIAD 
VPFHAVKWD 



LDTKKSLGPN 
LDTKKTLGPN 
LDTKKTLGVN 
LDTGKTLGVN 
LDTGKTLGVN 
LDTGKTLGVN 
LNTGKKLGPN 
TKTGKNLGPN 
RETGKALGPN 
RETGKALGPN 
PTTGKILGPN 



350 

VRQGYGLTET 
VRQGYGLTET 
VRQGYGLTET 
IRQGYGLTET 
IRQGYGLTET 
IRQGYGLTET 
IRQGYGLTET 
IIQGYGLTET 
IRCGFGLTES 
IRCGFGLTES 
VRQGYGLTET 



G-GLTE- 

400 

RRGEVCVKGP 
RRGEVCVKGP 
RRGEICVKGP 
QRGELCVKGP 
QRGELCVRGP 
QRGELCVKGP 
ERGEICFKGP 
QTGELCFKSD 
QVGELCIKGP 
QVGELCVKGP 
ETGELYFKGD 



A 
P 



--A G--G— 



.__ K --D — T-K-LG-N — GE 



401 



Lcr 


MLMKGYVNNP 


EATKELIDEE 


Lla 


MLMKGYVDNP 


EATREIIDEE 


Lmi 


SLMLGYSNNP 


EATRETIDEE 


Pmi 


MIMKGYVNNP 


EATNALIDKD 


Ppy 


MIMSGYVNNP 


EATNALIDKD 


Lno 


MIMKGYVNNP 


EATSALIDKD 


Ppel 


MIMKGY INNP 


EATRELIDEE 


Phg 


I IMKGYYQNE 


EETRLVI DKD 


GR 


MVSKGYVNNV 


EATKEAIDDD 


YG 


MVSKGYVNNV 


EATKEAIDDD 


Ppf>2 


MIMKSYYNNE 


EATKAI INKD 


49-' 7 C6 


G 




78-OblO r. 


DN 




(* 


DN 



450 

GWLHTGDIGY YDEEKHFFIV DRLKSLIKYK 
GWLHTGDIGY YDEEKHFFIV DRLKSLIKYK 
GWLHTGDIGY YDEDEHFFIV DRLKSLIKYK 
GWLHSGDIAY YDKDGHFFIV DRLKSLIKYK 
GWLHSGDI AY WDEDEHFFIV DRLKSLIKYK 
GWLHSGDIAY YDKDGHFFIV DRLKSLIKYK 
GWIHSGDIGY FDEDGHVYIV DRLKSLIKYK 
GWLHSGDIGY YDTDGNFHIV DRLKELIKYK 
GWLHSGDFGY YDEDEHFYVV DRYKELIKYK 
GWLHSGDFGY YDEDEHFYVV DRYKELIKYK 
GWLRSGDIAY YDNDGHFYIV DRLKSLIKYK 



Cons 
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451 

Lcr GYQVPPAELE SVLLQHPSIF 
Lla GYQVPPAELE SVLLQHPNIF 
Lrai GYQVPPAELE SVLLQHPNIF 
Pmi GYQVPPAELE SILLQHPFIF 
Ppy GYQVAPAELE SILLQHPNIF 
Lno GYQVPPAELE SILLQHPFIF 
Ppel GYQVPPAELE ALLLQHPFIE 
Phg AYQVAPAELE ALLLQHPYIA 
GR GSQVAPAELE EILLKNPCIR 
YG GSQVAPAELE EILLKNPCIR 
Ppe2 GYQVAPAEIE GILLQHPYIV 

4 9-7c6 

78-OblO 

90-lb5 



500 

DAGVAGVPDP VAGELPGAW VLESGKNMTE 
DAGVAGVPDP IAGELPGAW VLEKGKSMTE 
DAGVAGVPDP DAGELPGAW VMEKGKTMTE 
DAGVAGIPDP DAGELPAAW VLEEGKMMTE 
DAGVAGLPDD DAGELPAAW VLEHGKTMTE 
DAGVAGIPDP DAGELPAAW VLEEGKTMTE 
DAGVAGVPDE VAGDLPGAW VLKEGKSITE 
DAGVTGIPDE EAGELPAACV VLEPGKTMTE 
DVAWGIPDL EAGELPSAFV VIQPGKEITA 
DVAWGIPDL EAGELPSAFV VKQPGKEITA 
DAGVTGIPDE AAGELPAAGV WQTGKYLNE 



Cons 


— QV-PAE-E 


— LL--P-I- 




501 




Lcr 


KEVMDYVASQ 


VSNAKRLRGG 


Lla 


KEVMDYVASQ 


VSNAKRLRGG 


Lmi 


KEIVDYVNSQ 


WNHKRLRGG 


Pmi 


QEVMDYVAGQ VTASKRLRGG 


Ppy 


KEIVDYVASQ 


VTTAKKLRGG 


Lno 


QEVMDYVAGQ 


VTASKRLRGG 


Ppel 


KEIQDYVAGQ 


VTSSKKLRGG 


Phg 


KEVMDYIAER 


VTPTKRLRGG 


GR 


KEVYDYLAER 


VSHTKYLRGG 


YG 


KEVYDYLAER 


VSHTKYLRGG 


Ppe2 


QIVQNFVSSQ 


VSTAKWLRGG 


49-7c6 






78-0B10 D 




90-lb5 


DY A 




Cons 




V K-LRGG 




551 




Lcr 


AKM 




Lla 


AKM 




Lmi 


AKM 




Pmi 


SKL 




Ppy 


G . - . GKSKL 




Lno 


SKL 




Ppel 


GKSKSKAKL 




Phg 


AKL 




GR 


SKL 




YG 






Ppe2 


. K5KL 




49-7c6 


TNG" 




7 0 -Ob 10 TNG * 




Q0-lb5 


TNG * 





q — V-G-PD- -AG- LP- A- V V GK 

550 

VRFVDEVPKG LTGKI DGRA. IREILKKPV. 
VRFVDEVPKG LTGKIDGKA. IREILKKPV. 
VRFVDEVPKG LTGKI DAKV . IREILKKPQ. 
VKFVDEVPKG LTGKI DSRK. IREILTMGQK 
WFVDEVPKG LTGKLDARK. IREILIKAKK 
VKFVDEVPKG LTGKI DGRK. IREILMMGKK 
VEFVKEVPKG FTGKI DTRK . IKEILIKAQK 
VLFVNNIPKG ATGKLVRTE. LRRLLTQRA. 
VRFVDSIPRN VTGKITRKEL LKQLLEKS . . 
VRFVDSIPRN VTGKITRKEL LKQLLEKS.. 
VKFLDEIPKG STGKIDRKV . LRQMFEKH . . 



V-F P — -TGK 
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Key: 

Lcr: Luciola cruciata 

Lla: Luciola lateralis 

Lmi: Luciola mingrelica 

Pmi: Pyrocoelia miyako 

Ppy: Photinus pyralis 

Lno: Lampyris noctiluca 

Ppe-1: Photuris pennsylvanica (1) 

Phg: Phengodes sp. 

Gr: Pyrophorus plagiophthalamus (green) 

YG: Pyrophorus plagiophthalamus (yellow green) 

Ppe-2: Photuris pennsylvanica (2) 
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FIGURE 20 



tac Promoter 



Amp 



Apa 
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FIGURE 22 



GGATCCAATGGAAGATAAAAATATTTTATATGGACCTGAACCATTTTATCCCTTGGCTGA 

TGGG ACGGCTGG AGAACAGATGT TTT ACGCATT ATCT CGTT ATGCAGAT ATTT CAGG ATG 

CATAGCATTGACAAATGCTCATACAAAAGAAAATGTTTTATATGAAGAGTTTTTAAAATT 

GTCGT G TCGTTT AGCGG AAAGTTTT AAAAAGT ATGG ATT AAAACAAAACGACACAAT AGC 

GGTGTGTAGCGAAAATGGTTTGCAATTTTTCCTTCCrATAATTGCATCATTGT 

AATAATTGCAGCACCTGTTAGTGATAAATACATTGAACGTGAATTAATACACAGTCTTGG 

TATTGTAAAACCACGCATAATTTTTTGCTCCAAGAATACTTTTCAAAAAGTACTGAATGT 

AAAATCTAAATTAAAATATGTAGAAACTATTATTATATTAGACTTAAATGAAGACTTAGG 

AGGTTATCAATGCCTCAACAACTTTATTTCTCAAAATTCCGATATTAATCTTGACGTAAA 

AAAATTTAAACCATATTCTTTTAATCGAGACGATCAGGTTGCGTTGGTAATGTTTTCTTC 

TGGT ACAACTGGT GTTTCGAAGGG AGT CAT GCT AACTCACAAGAAT ATT GTTG CACGATT 

TT CTCTTGCAAAAGATCCT ACTTTTGGTAACGCAATTAATCCAACGACAGCAATTT^ 

GGTAATACCTTTCCACCATGGTTTTGGTATGATGACCACATTAGGATACTTTACTTGTGG 

ATTCCGAGTTGTTCTAATGCACIACGTTTGAAGAAAAACTATTTCTACAATCATTACAAGA 

TTATAAAGTGGAAAGTACTTTACTTGTACCAACATTAATGGGATTTCTTGCAAAAAGTGC 

ATTAGTTGAAAAGTACGATTTATCGCACTITAAAAGAAATTGCATCTGGTGGCGCACCTTT 

ATCAAAAGAAATTGGGGAGATGGTGAAAAAACGGTTTAAATTAAACTTTGTCAGGCAAGG 

GTATGGATTAACAGAAACCACTTCGGCTGTTTTAATTACACCGAACAATGACGTCAGACC 

GGGATCAACTGGTAAAATAGTACCATTTCACGCTGTTAAAGTTGTCGATCCTACAACAGG 

AAAAATTTTGGGGCCAAATGAAACTGGAGAATTGTATTTTAAAGGCGACATGATAATGAA 

AGGTTATTATAATAATGAAGAAGCTACTAAAGCAATTATTAACAAAGACGGATGGTTGCG 

CTCTGGTGATATTGCTTATTATGACAATGATGGCCATTTTTATATTGTGGACAGGCTGAA 

GTCATTAATTAAATATAAAGGTTATCAGGTTGCACCTGCTGAAATTGAGGGAATACTCTT 

ACAACATCCGTATATTGTTGATGCCGGCGTTACTGGTATACCGGATGAAGCCGCGGGCGA 

GCTTCCAGCTGCAGGTGTTGTAGTACAGACTGGAAAATATCTAAACGAACAAATCGTACA 

AAATTTTGTrrCCAGTCAAGTTTCAACAGCCAAATGGCTACGTGGTGGGGTGAAATTTTT 

GGATGAAATTCCCAAAGGATCAACTGGAAAAATTGACAGAAAAGTGTTAAGACAAATGTT 

TGAAAAACACACCAATGGG* 
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FIGURE 23 



GGATCCAATGGAAGATAAAAATATTTTATATGGACCTGAACCATTTTATCCCTTGGCTGA 

TGGGACGGCTGOAGAACAGATGTTTTACGCATTATCTCGTTATGCAGATATTTCAGGATG 

CAT AG CATTGACAAATG CT CAT ACAAAAGAAAAT GTTTT AT ATGAAGAGTTGTT AAAATT 

GTCGTGTOnTTAGCGGAAAGTTTTAAAAAGTATGGATTAAAACAAAACGACACA^ 

GGTGT GT AGCGAAAATGGTTTGCAA ITT IT CCTTCCT AT AATTGCAT CATTGT AT CTTGG 

AATAATTGCAGCACCTGTTAGTGATAAATACATTGAACGTGAATTA^ 

TATTGTAAAACCACGCATAATTTTTTGCTCCAAGAATACTT^ 

AAAATCTAAATTAAAATATGTAGAAACTATTATTATATTAGACTTAAATGAAGACTTAGG 

AGGTTATCAATGCCTCAACAACTTTATTTCTCAAAATTCCGATATTAA 

AAAATTTAAACCATATTCTTTTAATCGAGACGATCAGGTTGCGTTGGTAATGTTT^ 

TGGTACAACTGGTGTTTCGAAGGGAGTCATGCTAACTCACAAGAATA 

TTCTCATGCAAAAGATCCTACTTTTGGTAACGCAATTAATCCAAC^ 

GGTAATACCTTTCCACCATGGTTTTGGTATGATGACCACATTAGGATACTTTACTTGTGG 

ATTCCGAGTT GTT CT AATG CRCACGTTTGAAG AAAAACT ATTTCT ACAAT CATT ACAAGA 

TTATAAAGTGGAAAGTACTTTACTTGTACCAACATTAATGGCATTTTTTGCAAAAAGTGC 

ATTAGTTGAAAAGTACGATTTATCGCACTTAAAAGAAATTGCATCTGGTGGCGCACCTTT 

ATCAAAAGAAATTGGGGAGATGGTGAAAAAACGGTTTAAATTAAACTTTGTCAGGCAAGG 

GTATGGATTAACAGAAACCACTTCGGCTGTTTTAATTAC^CCGAACAATGACCTCAGACC 

GGGATCAACTGGTAAAATAGTACC7VTTTC7VCGCTGTTAAAGTTGTCGATCCTACAACAGG 

AAAAATTTTGGGGCCAAATGAAACTGGAGAATTGTATTTTAAAGGCGACATGATAATGAA 

AGGTTATTATAATAATGAAGAAGCTACTAAAGCAATTATTAACAAAGACGGATGGTTGCG 

CrCTGGTGATATTGCTTATTATGACAATGATGGCCATTTTTATATTGTGGACAGGCTGAA 

GTCATTAATTAAATATAAAGGTTATCAGGTTGC^CCTGCTGAAATTGAGGGAATACTCTT 

ACAACATCCGTATATTGTTGATGCCGGCGTTACTGGTATACCGGATGAAGCCGCGGGCGA 

GCTTCCAGCTGCAGGTGTTGTAGTACAGACTGGAAAATATCTAAACGAACAAATCGTACA 

AAATTTTGTTTCCAGTCAAGTTTCAACAGCCAAATGGCTACGTGGTGGGGTGAAATTTTT 

GGATGAAATTCCCAAAGGATCAACTGGAAAAATTGACAGAAAAGTGTTAAGACAAATGTT 

TGAAAAACACACCAATGGG * 
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FIGURE 24 



GGATCCAATGGAAGATAAAAATATTTTATATGGACCTGAACCATTTTATCCXriTGGCTGA 
TGGGACGGCTGGAGAACAGATGTTTTACGCATTATCTCGTTATGCAGA TATTT CAGGATC 
CATAGCATTGACAAATGCTCATACAAAAGAAAATGTTTTATATGAAGAGTTTTTAAAATT 
GTCGTGTCGTTTAGCGGAAAGTTTTAAAAAGTATGGATTAAAACAAAACGACACAATAGC 
GGTGTGTAGCGAAAATGGTTTGCAATTTTTCCTTCCTATAATTGCATCATTGTATCTTGG 
AATAATTGCAGCACCTGTTAGTGATAAATACATTGAACGTGAATTAATACACAGTCTTGG 
TATTGTAAAACCACGCATAATTTTTTGCTCCAAGAATACTTT^ 

AAAATCTAAATTAAAATATGTAGAAACTATTATTATATTAGACTTAAATGAAGACTTAGG 

AGGTTATCAATGCCTCAACAACITTATTTCTCAAAATTCCGATATTAATCTTGAC 

AAAATTTAAACCATATTCTTTTAATCGAGACGATCAGGTTGCGTTGGTAATGTTTTCTTC 

TGGTACAACTGGTGTTTCGAAGGGAGTCATGCTAACTCACAAGAATATTGTTGTACGAT^ 

TTCTCTTGX^AAAGATCCTACTTTTGGTAACGCAATTAAT 

GGTAATACCTTTCCACCATGGTT1TGGTATGATGACCACATTAGGATACTTTACT 

ATTCCX^GTTGTTCTAATGCACACGTTTGAAGAAAAACTATTTCTACAATCATTACA^ 

TTATAAAGTGGAAAGTACTTTACTTGTACGAACATTAAT 

ATTAGTTGAAAAGTACGATTTATCGCACTrAAAAGAAATTGCATCT 

ATCAAAAGAAATTGGGGAGATGGTGAAAAAACGGTTTAAATTAAACTTTGTCAGGCA^ 

GTATGGATTAACAGAAACCACTTCGGCTGTTTTAATTACACCGAACAATGACGTCAGACC 

GGGATCAACTGGTAAAATAGTACCATTTCACGCTGTTAAAGTTGTCGATCCTACAACAGG 

AAAAATTTTGGGGCCAAATGAAACTGGAGAATTGTATTTTAAAGGCGACATGATAATGAA 

AGGTTATTATAATAATGAAGAAGCTACTAAAGCAATTATTACCAAAGACGGATGGTTGCG 

CTCTGGTGATATTGCTTATTATGACAATGATGGCCATTTTTATATTGTGGACAGGCTGAA 

GTCATTAATTAAATATAAAGGTTATCAGGTTGCACCTGCTGAAATTGAGGGAATACTCTT 

ACAACATCCGTATATTGTTGATGCCGGCGTTACTGGTATACCGGATGAAGCCGCGGGCGA 

GCTTCCAGCTGCAGGTGTTGTAGTACAGACTGGAAAATATCTAAACGAACAAATCGTACA 

AAATTTTGTTTCCAGTCAAGTTTCAACAGCCAAATGGCTACGTGGTGGGGTGAAATTTTT 

GGATGAAATTCCCAAAGGATCAACTGGAAAAATTGACAGAAAAGTGTTAAGACAAATGTT 

TGAAAAACACACCAATGGG* 
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FIGURE 25 



GGATCCAATGGAAGATAAAAATATTTTATATGGACCTGAACCATTTTATCCCTTGGCTGA 

T GGG ACGGCTGG AG AACAGATGTTT T ACGCATT ATCT CGTT ATGCAGAT ATTT CAGGATG 

CATAGCATTGACAAATGCTCATACAAAAGAAAATGTTTTATATGAAGAGTTTTTAAA^ 

GTCGTGTCGTTTAGCXKSAAAGTTTTAAAAAGTATGGATTAAAACA^ 

GGTGTGTAGCGAAAATGGTTTGCAATTTTTCCTTCCTATAATTGCATCATTGTATCTTGG 

AATAATTGCAGC^CCTGTTAGTGATAAATACATTGAACGTGAATTAATACACAGTCTrTGG 

T ATTGT AAAACCACGCAT AATTT T TTGCTC CAAG AAT ACTTTTCAAAAAGT ACT GAATGT 

AAAATCTAAATTAAAATATGTAGAAACTATTATTATATTAGACTTAAATGAAGACTTAGG 

AGGTT ATCAATG CCT CAACAACTTT ATTTCT CAAAATTCCGAT ATT AAT CTTG ACGT AAA 

AAAATTTAAACCIATATTCrrTTAATCGAGACGATCAGGTTGCGTTGGTAATCj'rrrTCTTC 

TGGTACAACTGGTGTTTCGAAGGGAGTCATGCTAACTCACAAGAATATTGTTGCACGATT 

TTCTATTGCAAAAGATCCTACTTTTGGTAACGCAATTAATCCAACGAC^GCAATT^ 

GGTAATACCTTTCCACCATGGTTTTGGTATGATGACCACATTAGGATACTTTACTTGTGG 

ATTCCGAGTTGTTCTAATGCACACGTTTGAAGAAAAACTATTTCTAOUVTCATT 

TT AT AAAGTGGAAAGTACnTT ACTTCT ACCAACATTAATGG CATTTCTTGCAAAAAGTGC 

ATTAGTTGAAAAGTACGATTTATCGCACTTAAAAGAAATTGCATCT 

ATaUVAAGAAATTGGGGAGATGGTGAAAAAACtKrTTTAAATTAAACTT^ 

GTATGGATTAACAGAAACCACTTCGGCTGTTTTAATTACACCGAACAATGACGTCAGACC 

GGGATCAACTGGTAAAATAGTACCATTTCACGCTGTTAAAGTTGTCGATCCTACAACAGG 

AAAAATTTTGGGGCCAAATGAAACTGGAGAATTGTATTTTAAAGGCGACATGATAATGAA 

AGGTTATTATAATAATGAAGAAGCTACTAAAGCAATTATTAACAAAGACGGATGGTTGCG 

CTCTGGTGATATTGCTTATTATGACAATGATGGCCATTTTTATATTGTGGACAGGCTGAA 

GTCATTAATTAAATATAAAGGTTATCAGGTTGCACCTGCTGAAATTGAGGGAATACTCTT 

ACAACATCCGTATATTGTTGATGCCGG CGTT ACT GGTAT ACCGG ATGAAGCCGCGGGCG A 

GCTTCCAGCTGCAGGTGTTGTAGTACAGACTGGAAAATATCTAAACGAACAAATCGTACA 

AAATTTTGTTTCC^GTCAAGTTTCAACAGCC^ATGGCrACGTGGTGGGGTGAAArrTTT 

GGATGAAATTCCCAAAGGATCAACTGGAAAAATTGACAGAAAAGTGTTAAGACAAATGTT 

TGAAAAACACACCAATGGG * 
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FIGURE 26 



GGATCCAATGGAAGATAAAAATATTTTATATGGACCTGAACCATTTTATCCCTTGGCTGA 
TGGGACGGCTGGAGAACAGATGTTTGACGCATTATCTCGTTATGCAGATATTTCAGGATG 
CATAGCATTGACAAATGCTCATACAAAAGAAAATGTTTTATATGAAGAGTTTTTAAAATT 
GTCGTGTCGTTTAGCGGAAAGTTTTAAAAAGTATGGATTAAAACAAAACGACACAATAGC 
GGTGTGTAGCGAAAATGGTTTGCAATTTTTCCTTCCTATAATTGCATCATTGTATCTTGG 
AATAATTGCAGCACCTGTTAGTGATAAATACATTGAACGTGAATTAATACACAGTCTTGG 
TATTGTAAAACCACGCATAATTTTTTGCTCCAAGAATACTTTTCAAAAAGTACTGAATGT 
AAAATCrAAATTAAAATATGTAGAAACTATTATTATATTAGACTTAAATGAAGACTTAGG 
AGGTTATCAATGCCTCAACAACTTTATTTCTCAAAATTCCX^ 

AAAATTTAAACCATATTCTTTTAATCGAGACGATCAGGTTGCGTTGGTAATGTTTTCTTC 
TGGTACAACTGGTGTTTCGAAGGGAGTCATGCTAACTCACAAGAATATTGTTGCACGATT 
TTCTCATGCAAAAGATCCTACTTTrGGTAACGCAATTAATCCAACGACAGCAATTTTAAC 
GGT AATACCTTTCCACCATGGTTTTGGT ATGA t GACCACATT AGGATACTTT ACTTGTGG 
ATTCCGAGTTGTTCT AATGCACACGTTT G AAGAAAAACT ATTTCT ACAATCATTACAAGA 
TTATAAAGTGGAAAGTACTTTACTTGTACCAACATTAATGGCATTTTTTGCAAAAAGTGC 
ATTAGT TG AAAAGTACGATTTATCGCACTT AAAAGAAATTGCATCTGGTGGCGCACCTTT 
ATCAAAAGAAATTGGGGAGATGGTGAAAAAACGGTTTAAATTAAACTTTGTCAGGCAAGG 
GTATGGATTAACAGAAACCACTTCGGCTGTTTTAATTACACCGAACAATGACGTCAGACC 
GGGATCAACTGGTAAAATAGTACCATTTCACGCTGTTAAAGTTGTCGATCCTACAACAGG 
AAAAATTTTGGGGCCAAATGAAACTGGAGAATTGTATTTTAAAGGCGACATGATAATGAA 
AGGTTATTATAATAATGAAGAAGCTACTAAAGCAATTATTAACAAAGACGGATGGTTGCG 
CTCTGGTGATATTGCTTATTATGACAATGATGGCCATTTTTATATTGTGGACAGGCTGAA 
GTCATT AATT AAAT AT AAAGGTT AT CAG G TT G C AC CTG CT G AAAT TGAGGG AATACTCTT 
ACAACATCCGTATATTGTTGATGCCGGCGTTACTGGTATACCGGATGAAGCCGCGGGCGA 
GCTTCCAGCTGCAGGTGTTGTAGTACAGACTGGAAAATATCTAAACGAACAAATCGTACA 
AAATTTTGTTTCCAGTCAAGTTTCAACAGCCAAATGGCTACGTGGTGGGGTGAAATTTTT 
GGATGAAATTCCCAAAGGATCAACTGGAAAAATTGACAGAAAAGTGTTAAGACAAATGTT 
TG AAAAAC ACACCAATGGG * 
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FIGURE 27 



D PME D KN I L YG PE P FY PLADGT AGEQM FY A LS R YAD I SGC I ALTNAHTKENVLY E E FLKL 
SCRLAESFKKYGLKQNDTIAVCSENGLQFFXPIIASLYLGIIAAPVSDKYIERELIHSLG 
I VKPRI I FCSKNTFOKVLNVKSKLKYVETI I ILDLNEDLGGYQCLNNFISQNSDINLDVK 
KFKPYSFNRDDQVALVMFSSGTTGVSKGVMLTHKNIVARFSIAKDPTFGNAINPTTAILT 
VIPFHHGFGMOTTI/SYFTCGFRVVTjMHTFEEKLFLQSI^DYKVESTLLVPTUIAFTAKSA 
LVEKYDLSHUCEIASGGAPLSKEIGEMVKKRFKLNFSmOGYGLTETTSAVLITPNNDVRP 
GSTGK I VP FHAVKWDPTTGKI LGPNETGELYFKGDMIMKGYYNNEEATKAI INKDGWLR 
SGD I AYY DNDGH FY I VDRLKS LI KYKGYQVAPAEI EGI LLQH PYI VDAGVTGI P DEAAGE 
LPAAGWVQTGKYLNEQIVQNFVSSQVSTAKWLRGGVKFLDEIPKGSTGKIDRKVLRQMF 
EKHTNG 
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FIGURE 28 



D PMEDKN I LYG PE P FY P LA DGT AGEQM FY ALSR YAO I SGCI ALTN AHTKENV LYEEI»LKL 
S CRLAES FKKYG LKQN DT I AVCS ENGLQ FFL PI I AS L YLGI I AA PVS DKY I EREL I HS LG 
IVKPRI I FCSKNTFQKVLNVKSKLKYVETI I ILDLNEDLGGYQCLNNFISQNSDINLDVK 
KFKPYSFNRDDQVALVMFSSGTTGVSKGVMLTHKNIVARFSHAKDPTFGNAINPTTAILT 
VIPFHHGFGMMTTI/^YFTCGFRVVI^HTFEEKLFI^SUJDYKVESTLLVPTLMAFFA 
LVEKYDl^HUCEIASGGAPI^KEIGEMVKKRFKUJFVRQGYGLTETTSAVLITPNNDVRP 

GSTGKIVPFHAVKWDPTTGKILGPNETGELYFKGDMIMKGYYNNEEATKAIINKDGWLR ✓ 

SGDIAYYDNDGHFYIVDRLKSLIKYKGYQVAPAEIEGILLQHPYIVDAGVTGIPDEAAGE 

LPAAGVWQTGK YLNEQI VQN FVS SQVST AKW LRGGVK FLDE I PKGSTGKI DRKVLRQMF 

EKHTNG 
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FIGURE 29 



DPMEDKNILYGPEPFYPLADGTAGEQMFYALSRYADISGCIALTNAHTKENVLYEEFLKL 
S CRLAES FKKYGLKQN DT I AVCS ENG LQ F FL P 1 1 AS L YLG I 1 AAP VSOK Y IERELIHSLG 
I VKPRI I FCSKNTFQKVLNVKSKLKYVETI I ILDLNEDLGGYQCLNNFISONSDINLDVK 
KFKPYSFNRDDQVALVMFSSGTTGVSKGVMLTHKNIWRFSLAKDPTFGNAINPTTAILT 
VIPFHHGFX^IMTTIXSYFTCGFRVVLMHTFEEKLFLQSl^DYKVESTLLVPTLMAFFAKSA 
LVEKYDLSHLKEIASGGAPLSKEIGEMVKKRFKLNFVRQGYGLTETTSAVLITPNNDVRP 
GSTGKIVPFHAVKVVDPTTGKILGPNETGELYFKGDMIMKGYYNNEEATKAIITKDGWLR 
SGD I A Y Y DN DG H FY I VDRLKS LI KYKG YQV A PAE I EG I LLQH PY I VDAGVTG I P DEAAGE 
LPAAGVWQTGKYLNEQI VQN FVSSQVSTAKWLRGGVKFLDE I PKGSTGKI DRKVLRQMF 

EKHTMG 
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FIGURE 30 



D PMEDKN I L YG PE P FY P LADGT AGEQM FYALS R Y AD I SGC I ALTN AHTKENV LYEE FLKL 
SCRLAESFKKYGLKQNDTIAVCSENGLQFFLPIIASLYLGIIAAPVSDKYIERELIHSLG 
IVKPRI I FCSKNTFQKVLNVKSKLKYVETI I ILDLNEDLGGYQCLNNFI SQNSDINLDVK 
K FKPYS FNRDDQVALVM FS S GTTGVSKG VM LTHKN I VAR FS IAKDPT FGN A I N PTT A I LT 
V I P FHHG FGMMTT LG Y FTCG FR WLMHT PEEK L FLQS LQDYKVEST LLV PT LMA FLAKS A 
LVEKYDLSHLKEIASGGAPLSKEIGEMVKKRFKLNFVRQGYGLTETTSAVLITPNNDVRP 
GSTGKIVPFHAVKWDPTTGKILGPNETGELYFKGDMIMKGYYNNEEATKAIINKDGWLR 
SGDIAYYDNDGHFYIVDRLKSLIKYKGYQVAPAEIEGILLQHPYIVDAGVTGIPDEAAGE 
LPAAGVWOTGKY LNEQ I VQN FVSSQVSTAKWLRGGVKFLDEI PKGSTGKI DRKVLRQM F 
EKHTNG 
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FIGURE 31 



D PMEDKN I LYG PE P FY P LADGTAG EQM FD ALSRYAD I SGC I ALTNAHTKEN V LYEEFLKL 
SCRLAESFKKYGLKQNDTIAVCSENGLQFFLPIIASLYLGIIAAPVSDKYIERELIHSLG 
IVKPRIIFCSKin , TOKVLNVKSKIJ<YVETIIILDLNEDLGGYOCLNNFISQNSDINLDVK 
KFKPTSFNRDDQVALVMFSSGTTGVSKGVMLTHKNIVARFSHAKDPTFGNAINPTTAILT 
V I PFHHGFGMMTTLGY FTCG FRWLMHT FEEKLFLQSLQDYKVESTLLVPTU4AFFAKS A 
LVEKYDLSHLKEIASGGAPLSKEIGEMVKKRFKLNFVRQGYGLTETTSAVLITPNNDVRP 
GSTGKIVPFHAVKWDPTTGKILGPNETGELYFKGDMIMKGYYNNEEATKAIINKDGWLR 
SGDIAYY DNDGHFY I VDRLKSLIKYKGYQVAPAE I EGI LLOHPY I VDAGVTG I PDEAAGE 
LPAAGVWQTGKYLNEQI VQN FVSSQVSTAKWLRGGVK FLDE I PKGSTGKI DRKVLRQMF 
EKHTNG 
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FIGURE 32 



GGATCCAATGGCAGATAAAAATATTTTATATGGGCCCGAACCATTTTATCCCTTGGCTGA 

TGGGACGGCTGGAGAACAGATGTTTGACGCATTATCTCGTTATGCAGATATTTCAGGATG 

CATAGCATTGACAAATGCTCATACAAAAGAAAATGTTTTATATGAAGAGTTTTTAAAATT 

GTCGTGTCGTTTAGCGGAAAGTTTTAAAAAGTATGGATTAAAACAAAACGACACAATAGC 

GGTGTGTAGCGAAAATGGTTTGCAATTTTTCCTTCCGTAATTGCATCATTGTATCTTGGA 

ATAATTGCAGCACCTGTTAGTGATAAATACATTGAACGTGAATTAATACACAGTCTTGGT 

ATTGTAAAACCACGCATAATTTTTTGCTCCAAGAATACTrTTCAAAAAGTACTGAATGTA 

AAATCTAAATTAAAATCTGTAGAAACTATTATTATATTAGACTTAAATGAAGACTTAGGA 

GGTTATCAATGCCTCAACAACTTTATTTCTCAAAATTCCGATATTAATCTTGACGTAAAA 

AAATTTAAACCATATTCTTTTAATCGAGACGATCAGGTTGCGTTGGTAATGTTTTCrrCT 

GGTACAACTGGTGTTTCGAAGGGAGTCATGCTAACTCACAAGAATATTGTTGCACGATTT 

TCTCTTGCAAAAGATCCTACTTTTGGTAACGCAATTAATCCCACGACA^ 

GT AAT ACCTTTCCACCATGGTTTTGGT ATGA t g ACCACATTAGGAT ACTTT ACTTGTGGA 

TTCCGAGTTGTTCTAATGCACACGTTTGAAGAAAAACTATTTCT^ 

TATAAAGTGGAAAGTAC1TTACTTGTACCAACATTAATGGCATTTCTTGCAAAAAGTGCA 
TTAGTTGAAAAGTACGATTTATCGCACTTAAAAGAAATTGCATCTGGTGGCGCACCTTTA 
TCAAAAGAAATTGGGGAGATGGTGAAAAAACGGTTTAAATTAAACTTTGTCAGGCAAGGG 
TATGGATTAACAGAAACCACTTCGGCTGTTTTAATTACACCGAAAxxxxxxGCCAGACCG 
G GAT C AACTG GT AAAAT AGT ACCATTT CACG CT GTT AAAGTT GTCG AT CCT ACAACAGG A 
AAAATTTTGGGGCCAAATGAACCTGGAGAATTGTATTTTAAAGGCGCCATGATAATGAAG 
GGTTATTATAATAATGAAGAAGCTACTAAAGCAATTATTGATAATGACGGATGGTTGCGC 
TCTGGTGATATTGCTTATTATGACAATGATGGCCATTTTTATATTGTGGACAGGCTGAAG 
TCATTAATTAAATATAAAGGTTATCAGGTTGCACCTGCTGAAATTGAGGGAATACTCTTA 

CAACATCCGTATATTGTTGATGCCGGCGTTACTGGTATTCCGGATGAAGCCGCGGGCGAG , 

CTTCCAGCTGCAGGTGTTGTAGTACAGACTGGAAAATATCTAAACGAACAAATCGTACAA 

GATTTTGTTTCCAGTCAAGTTTCAACAGCCAAATGGCTACGTGGTGGGGTGAAATTTTTG 

GATGAAATTCCCAAAGGATCAACTGGAAAAATTGACAGAAAAGTGTTAAGACAAATGTTT 

GAAAAACACACCAATGGG* 
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FIGURE 33 



GGATCCAATGGCAGATAAAAATATTTTATATGGGCCCGAACCATTTTATCCCTTGGCTGA 

TGGGACGGCTGGAGAACAGATGTTTTACGCATTATCTCGTTATGCAGATATTTCAGGATG 

C AT AGCATTG ACAAATGCTCAT ACAAAAGAAAATGTTTT AT ATG AAG AGT T J 1 T T AAAATT 

GTCGTGTCGTTTAGCGGAAAGTTTTAAAAAGTATGGATTAAAACAAAACGACACAATAGC 

GGTGTGTAGCGAAAATGGTTTGCAATTTTTCCTTCCTCTAATTGCATCATTCT 

AATAATTGCAGCACCTGTTAGTGATAAATACATTGAACGTGAATTAATACACAGTCTTGG 

TATTGTAAAACCACGCATAATTTTTTGCTCCAAGAATACTTTTCAAAAAGTACTGAATGT 

AAAATCTAAATTAAAATATGTAGAAACTATTATTATATTAGACTTAAATGAAGACTTAGG 

AGGTTATCAATGCCTCAACAACTTTATTTCTCAAAATTCCGATATTAATCT T 

AAAATTTAAACCATATTCTTTTAATCGAGACGATCAGGTTGCGTTGGTAATGTTTTCTTC 

TGGT ACAACT GGTGTTC CGAAGGGAGTCATGCT AACT CACAAG AAT ATTGTTGCACGATT 

TTCTCTTGCAAAAGATCCTACTTTTGGTAACGCAATTAATCCAACGACAGCAATTT^ 

GGTAATACCTTTCCACCATGGTTTTGGTATGATGACCACATTAGGATACTTTACTTGTGG 

ATTCCGAGTTGTTCTAATGCACACGTTTGAAGAAAAACTATTTCTACAATCATTACAAGA 

TTATAAAGTGGAAAGTACTTTACTTGTACCAACATTAATGGCATTTCTTGCAAAAAGTGC 

ATTAGTTGAAAAGTACGATTTATCGCACTTAAAAGAAATTGCATCTGGTGGCGCACCTTT 

ATCAAAAGAAATTGGGGAGATGGTGAAAAAACGGTTTAAATTAAACTTTGTCAGGCAAGG 

GTATGGATTAACAGAAACCACTTCGGCTGTTTTAATTACACCGAAAxxxxxxGTCAGACC 

GGGATCAACTGGTAAAATAGTACCATTTCACGCTGTTAAAGTTGTCGATCCTACAACAGG 

AAAAATTTTGGGGCCAAATGAACCTGGAGAATTGTATTTTAAAGGCGACATGATAATGAA 

AGGTTATTATAATAATGAAGAAGCTACTAAAGCAATTATTGATAAAGACGGATGGTTGCG 

CTCTGGTGATATTGCTTATTATGACAATGATGGCCATTTTTATATTGTGGACAGGCTGAA 

GTCATTAATTAAATATAAAGGTTATCAGGTTGCACCTGCTGAAATTGAGGGAATACTCTT 

ACAACATC CGT AT ATTGTTGATGCCGGCGTT ACTG GT AT ACCGGATGAAG CCGCGGGCGA 

GCTTCCAGCTGCAGGTGTTGTAGTACAGACTGGAAAATATCTAAACGAACAAATCGTACA 

AAATTrrGTTTCCAGTCAAGTTTCAACAGCCAAATGGCTACGGGGTGGGGTGAAATTTTT 

GGATGAAATTCCCAAAGGATCAACTGGAAAAATTGACAGAAAAGTGTTAAGACAAATGTT 

TGAAAAACACACCAATGGG * 
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FIGURE 34 



GGATCCAATGGCAGATAAAAATATTTTATATGGGOCCGAACCATTTTATCCCTTGGCTGA 

TGGGACGGCTGGAGAACAGATGTTTGACGCATTATCTCGTTATGCAGATATTCCCGGATG 

CATAGCATTGACAAATGCTCATACAAAAGAAAATGTTTTATATGAAGAGTTTTTAAAATT 

GTCGTGTCGTTTAGCGGAAAGTTTTAAAAAGTATGGATTAAAACAAAACGACACAATAGC 

GGTGTGTAGCGAAAATGGTTTGCAATATTTCCTTCCTGTAATTGCATCA 

AATAATTGCAG CACCTGTT AGTGATAAAT ACATTGAACGTG AATTAAT ACACAGTCTTGG 

TATTGTAAAACCACGCATAATTTTTTGCTCCAAGAATACTTTTCAAAAAGTACTG 

AAAATCTAAATTAAAATATGTAGAAACTATTATTATATTAGACTTAAATGAAGACTTAGG 

AGGTTATCAATGCCTCAACAACTTTATTTCTCAAAATTC^GATATTAATCTT GACGTAA A 

AAAATTT AAACCAAATTCTTTTAAT CGAG ACG ATCAGGTTGCGTTGGTAATGTTTTCTTC 

TGGTACAACTGGTGTTCCGAAGGGAGTCATGCTAACTCACAAGAATATTGTTG CACC A^ 

TTCTATTGCAAAAGATCCT ACTTTTGGTAACGCAATTAAT CCAAOGACAGCAATTTTAAC 

GGTAATACCTTTCCACCATGGTrTTGGTATt^TGACCACATTAGGATACTTTACT^ 

ATTCXGAGTTGTTCTAATGCACACGTTTGAAGAAAAACTATTTCTACAATCATTACAAGA 

TTATAAAGTGGAAAGTACTTTACTTGTACCAACATTAATGGCATTTCTTGCAAAAAGTGC 

ATTAGTTGAAAAGTACGATTTATCGCACTTAAAAGAAATTGCATCTGGTGGCGCACCTTT 

ATCAAAAGAAATTGGGGAGATGGTGAAAAAACGGTTTAAATTAAACTTTGTCAGGCAAGG 

GT ATG GAT T AACAGAAACCACTT CGGCT GT TTT AATT ACAC CG AAAxxxxxxGCCAGAC C 

GGGATCAACTGGTAAAATAGTACCATTTCACGCTGTTAAAGTTGTCGATCCTACAACAGG 

AAAAAT TTT GGGG CCAAATGAAC CT GGAGAATT GT AT TT T AAAGG CGCCAT GAT AATGAA 

GGGTTATTATAATAATGAAGAAGCTACTAAAGCAATTATTGATAAAGACGGATGGTTGCG 

CTCTGGTGATATTGCTTATTATGACAATGATGGCCATTTTTATATTGTGGACAGGCTGAA 

GTCATTAATTAAATATAAAGGTTATCAGGTTGCACCTGCTGAAATTGAGGGAATACTCTT 

ACAACATCCGTATATTGTTGATGCCGGCGTTACTGGTATACCGGATGAAGCCGCGGGCGA 

GCTTCCAGCTGCAGGTGTTGTAGTACAGACTGGAAAATATCTAAACGAACAAATCGTACA 

AAATTTTGTTTCCAGTCAAGTTTCAACAGCCAAATGGCTACGTGGTGGGGTGAAATTTTT 

GGATGAAATTCCCAAAGGATCAACTGGAAAAATTGACAGAAAAGTGTTAAGACAAATGTT 

TGAAAAACACACCAATGGG* 
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FIGURE 35 



GGATCCAATGGCAGATAAAA/VTATTTTATATGGGCCCX3AACCATTTTATCCCTTGGCTGA 

TGGGACGGCTGGAGAACAGATGTTTGACGCATTATCTCGTTATGCAGATATTCCCGGATG 

CATAGCATTGACAAATGCTCATACAAAAGAAAATGTTTTATATGAAGAGTTTTTAAAATT 

GTCGTGTCGTTTAGCGGAAAGTTTTAAAAAGTATGGATTAAAACAAAACGACACAATAGC 

GGTGTGTAGCGAAAATGGTTTGCAATTTTTCCTTCCTGTAATTGCATCATTGTATCTTGG 

AATAATTGCAGCACCTGTTAGTGATAAATACGTTGAACGTGAATTAATACACAGTCTTGG 

TATTGTAAAACCACGCATAATTTTTTGCTCCAAGAATACTTTTCAAAAAGTACTGAATCT 

AAAATCTAAATTAAAATATGTAGAAACTATTATTATATTAGACTTAAATGAAGACTTAGG 

AGGTTATCAATGCCTCAACAACTTTATTTCTCAAAATTCCGATAGTAATCTGGACGTAAA 

AAAATTTAAACCAAATTCTTTTAATCX»AGACGATCAGGTTGCGTTGGTAATG ^ 

TGGTACAACTGGTGTTCCGAAGGGAGTCATGCTAACTCACAAGAATATTGTTGCACGATT 

TTCTCTTGCAAAAGATCCTACTTTTGGTAACGCAATTAATCCAACGACAGCAATTTTAAC 

GGTAATACCTTTCCACCATGGTTTTGGTATGATGACCACATTAGGATACTTTACTTGTGG 

ATTCCGAGTTGTTCTAATGCACACGTTTGAAGAAAAACTATTTCTACAATCATTACAAGA 

TTATAAAGTGGAAAGTACTTTACTTGTACCAACATTAATGGCATTTCTTGCAAAAAGTGC 

ATTAGTTGAAAAGTACGATTTATCGCACTTAAAAGAAATTGCATCTGGTGGCGCACCTTT 

ATCAAAAGAAATTGGGGAGATGGTGAAAAAACGGTTTAAATTAAACTTTGTCAGGCAAGG 

GTATGGATTAACAGAAACCACTTCGGCTGTTTTAATTACACCGAAAxxxxxxGCCAGACC 

GGGATCAACTGGTAAAATAGTACCATTTCACGCTGTTAAAGTTGTCGATCCTACAACAGG 

AAAAATTTTGGGGCCAAATGAACCTGGAGAATTGTATTTTA7VAGGCGCCATGATAATGAA 

GGGTTATTATAATAATGAAGAAGCTACTAAAGCAATTATTGATAAAGACGGATGGTTGCG 

CTCTGGTGATATTGCTTATTATGACAATGATGGCCATTTTTATATTGTGGACAGGCTGAA 

GTCATTAATTAAATATAAAGGTTATCAGGTTGCACCTGCTGAAATTGAGGGAATACTCTT 

ACAACATCCGTATATTGTTGATGCCGGCGTTACTGGTATACCGGATGAAGCCGCGGGCGA 

GCTTCCAGCTGCAGGTGTTGTAGTACAGACTGGAAAATATCTAAACGAACAAATCGTACA 

AAATTTTGTTTCCAGTCAAGTTTCAACAGCCAAATGGCTACGTGGTGGGGTGAAATTTTT 

GGATGAAATTCCCAAAGGATCAACTGGAAAAATTGACAGAAAAGTGTTAAGACAAATGTT 

TG AAAAAC ACACCAATGGG * 
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FIGURE 36 



D PMADKN I LYGPEPFY PLADGTAGEQMFBALSRYADI SGCI ALTNAHTKENVLYEE FLKL 
SCRIAESFKKYGLKQNDTIAVCSENGLQFFLPVIASLYLGIIAAPVSDKYIERELIHSLG 
I VKPRI I FCSKNTPQKVLNVKSKLKSVET 1 1 ILDLNEDLGGYQCLNNFISQNSDINLDVK 
KFKPYS FNRDDQVALVMFS SGTTGVSKGVMLTHKNIVARFSIAKDPTFGNAIN PTTAI LT 
VIP FHHGFGMMTTLG Y FTCG FR WLMHT FEEKL FLQSLQD YKVE ST LLV PTLMAFIAKS A 
LVEKYDLSHLKE I ASGGAPLSKEIGEMVKKRFKLNFVRQG YGLTETTSAVLIT PKxxARPG 
STGKIVPFHAVKVVDPTTGKILGPNEPGELYFKGAMIMKGYYNNEEATKAIIDNDGWLRS 
GDI A YY DN DGH FYIVDRLKSLI KYKG YQVAPAE I EG I LLQH P Y I VD AGVTGI P DEAAGEL 
PAAGVWQTGKYLNEQIVQDFVSSOVSTAKWLRGGVKFLDEIPKGSTGKIDRKVLRQMFE 

KHTNG$ 
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FIGURE 37 



DPMADKN I LYGPEPFYPLADGTAGEOMFYALSRYADISGCIALTNAHTKENVLYEEFLKL 
SCRLAESFKKYGLKQNDTIAVCSENGLQFFLPVIASLYLGIIAAPVSDKYIERELIHSLG 
I VKPRI IFCSKNTFQKVLNVKSKLKYVETI I ILDLNEDLGGYQCLNNFISQNSDINLDVK 
K FK PYS FN RDDQVALVM FS SGTTGVPKGVH LTHKN I VAR FSLAK DPT FGN A IN PTT AILT 
VIP FHHG FGMMTTLG Y FTCG FRWLMHT FEEKL FLQS LQDYKVESTLLV PTLMAFLAKSA 
L VEKY DLS HLKE I AS GGAP LSKE I GEMVKKR FKLN FVRQG YGLTETTS AVL I TPKxx VR PG 
STGK I V P FHAVKWD PTTGK I LG PN EPGELY FKGDM IMKG YYNNEEATKAI IDKDGW LRS 
GDIAYYDNDGHFYIVDRLKSLIKYKGYQVAPAEIEGILLQHPYIVDAGVTGIPDEAAGEL 
PAAGVWQTGKYLNEQI VQN FVSSQVSTAKWLRGGVKFLDEI PKGSTGK I DRKVLRQMFE 
KHTNG 
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FIGURE 38 



D PMADKN I L YG PE P FY P LADGTAGEQM FD ALS R Y AD I PGC I ALTN AHTKEN V LYEE FLKL 
SCRIAESFKKYGLKQNDTIAVCSENGLQYFLPVIASLYLGIIAAPVSDKYIERELIHSIiG 
IVKPRIIFCSKNTFQKVLNVKSKLKYVETIIILDLNEDLGGYOCLNNFISONSDINLDVK 
K FKPNS FNR DDQVALVM FS SGTTGV PKGVMLTHKN I VARFS IAKDPT FGNAIN PTTAI I*T 
VIP FHHG FGMMTT LGY FTCGFR WLMHTFEEKL FLOS LQD YKVESTLLVPT LMA FLAKS A 
L VEKY D LS H LKE I A SGGA P LS KE I GEMVKKR FK LN FVRQGYG LTETTS AVL IT PKxxARPG 
STGKIVPFHA\nCVVDPTTGKILGPNEPGELYFKGAMIMKGYYNNEEATKAI IOKDGWLRS 
GDIAYYDNDGHFYIVDRLKSLIKYKGYOVAPAEIEGILLOHPYIVDAGVTGIPDEAAGEL 
PAAGVVVQTGKYLNEOIVQNFVSSQVSTAKWLRGGVKEXDEI PKGSTGKI DRKVLRQMFE 
KHTNG 
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FIGURE 39 



D PMADKN I L YG PE P FY P LADGT AG E QM FD ALS RYADI PGC I ALTNAHTKENV LYEE FLKL 

SCRI-AESFKKYGLKQNDTIAVCSENGLQFFLPVIASLYLGIIAAPVSDKYVERELIHSLG 

I VKPRI I FCSKNTFQKVLNVKSKLKYVETI 1 1 LDLNEDLGGYQCLNNFI SQNSDSNLDVK 

KFKPNSFNRDDCVALVMFSSGTTGVPKGVMLTHKNIVARFS1JUCDPTFGNAINPTTAILT 

V I PFHHG FGMMTTLGYFTCGFRWLMHT FEEKLFLQSLQDYKVESTLLVPTLMAFIAKSA 

LVEKYDLSHLKEIASGGAPLSKEIGEMVKKRFKLNFVRQGYGLTETTSAVLITPKxxARPG 

STGKIVPFHAVKVVDPTTGKIIX3PNEPGELYFKGAMIMKGYYNNEEATKAIIDKDGWLRS 

GDIAYYDNDGHFYIVDRLKSLIKYKGYQVAPAEIEGILLQHPYIVDAGVTGIPDEAAGEL 

PAAGVWQTGKY LNEQ I VQN FV S SQVST AKW LRGGVKFLDE I PKGSTGK I DRKVLRQMFE 

KHTNG 
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FIGURE 40 



GGATCCAATGGCAGATAAAAATATTTTATATGGGCCCGAACCATTTTATCCCTTGGCTGA 
TGGGACGCCTGGAGAACAGATGTTTGACGCATTATCTCGTTATGCAGATATTCCGGCCTG 
CATAGCATTGACAAATGCTCATACAAAAGAAAATGTTTTATATGAAGAGTTTTTAAAATT 
GTCGTGTCGTTTAGCGGAAAGTTTTAAAAAGTATGGATTAAAACAAAACGACACAATAGC 
GGTGTGTAGCGAAAATGGTTTGCAATTTTTCCTTCCTGTAATTGCATCATTGTATCTTGG 
AATAATTGTGGCACCTGTTAACGATAAATACATTGAACGTGAATTAATACACAGTCTTGG 
TATTGTAAAACCACCCATA G ^ riTTl w r GCTCCAAGAATACTTTTCAAAAAGTACTGAATGT 
AAAATCTAAATTAAAATCTGTAGAAACTATTATTATATTAGACTTAAATGAAGACTTAGG 
AGGTT ATCAATG CCT C AACAACTTT ATTTCT CAAAATTCCGAT ATT AAT CTTG ACGT AAA 
AAAATTTAAACCATATTCrTrrAATCGAGACGATCAGGTTGCGTTGATTATGTTTTCTTC 
TGGTACAACTGGTCTGCCGAAG<^AGTCATGCTAACTCACAAGAATATTGTTGCACGATT 
TTCTCTTGCAAAAGATCCTACTTTTGGTAACGCAATTAATCCCACGACAGCAATTTTAAC 
GGTAATACCTTTCCACCATGGTTTTGGTATGATGACCACATTAGGATACTTTACTTGTGG 
ATTCCGAGTTGTTCTAATGCACACGTTTGAAGAAAAACTATTTCTACAATCATTACAAGA 
TTATAAAGTGGAAAGTACTTTACTTGTACCAACATTAATGGCATTTCTTGCAAAAAGTGC 
ATTAGTTGAAAAGTACGATTTATCGCACTTAAAAGAAATTGCATCTGGTGGCGCACCTTT 
ATCAAAAGAAATTGGGGAGATGGTGAAAAAACGGTTTAAATTAAACTTTGTCAGGCAAGG 
GTATGGATTAACAGAAACCACTTCGGCTGTTTTAATTACACCGAAAxxxJcxxGCCAGACC 
GGGATCAACTGGTAAAATAGTACCATTTCACGCTGTTAAAGTTGTCGATCCTACAACAGG 
AAAAATTTTGGGGCCAAATGAACCTGGAGAATTGTATTTTAAAGGCCCGATGATAATGAA 
GGGTT AT T AT AAT AATG AAG AAG CT ACT AAAG CAATT ATTGAT AATG ACGG ATG GTTGCG 
CTCTGGTGATATTGCTTATTATGACAATGATGGCCATTTTTATATTGTGGACAGGCTGAA 
GTCATTAATTAAATATAAAGGTTATCAGGTTGCACCTGCTGAAATTGAGGGAATACTCTT 
ACAACATCCGTATATTGTTGATGCCGGCGTTACTGGTATTCCGGATGAAGCCGCGGGCGA 
GCTTCCAGCTGCAGGTGTTGTAGTACAGACTGGAAAATATCTAAACGAACAAATCGTACA 
AGATTTTGTTTCCAGTCAAGTTTCAACAGCCAAATGGCTACGTGGTGGGGTGAAATTTTT 
GGATGAAATTCCCAAAGGATCAACTGGAAAAATTGACAGAAAAGTGTTAAGACAAATGTT 
TGAAAAACACACCAATGGG 
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FIGURE 41 



DPMADKNILYGPEPFYPLADGTAGEQMFDALSRYADIPGCIALTNAHTKENVLYEEFLKL 
SCRLAES FKKYGLKQNDT IAVCSENGLQFFLPVI AS LYLG I IVAPVNDKYI ERELIHS LG 
I VK PRI VFCS KNT FQKV LNVKSKLKSVET III LDLNE DLGGYQCLNN F I SQNS D I N LO VK 
KFKPYSFNRDDQVALIMFSSGTTGLPKGVMLTHKNIVARFS1AKDPTFGNAINPTTAILT 
V I P FHHG FGMMTT LG Y FTCG FR WLMHT FEE K L FLOS LQ D YKVEST LLV PT LMA FLAKS A 
LVEKYDLSH LKE I ASGG APL S KE I GEMVKKR FKLN FVRQGYG LTETTS A V LIT PKxxAR PG 
STGK I VP FHAVKWDPTTGK I LG PNEPGELY FKGPM IMKGYYNNEEATKAI IDNDGWLRS 
GDIAYYDNDGHFYIVDRLKSLIKYKGYQVAPAEIEGILLQHPYIVDAGVTGIPDEAAGEL 
PAAGWVQTGKY LNEQI VQD FVS SQVSTAKW LRGGVK FLDEI PKGSTGKI DRKVLRQM FE 
KHTNG 
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FIGURE 42 



GG AT C CAATGGCAGAT AAGAAT AT TTT AT ATGGGCCCG AACCATTTT ATCCCTTGGAAG A 

TGGGACGGCTGGAGAACAGATGTTTGA^CATTATCTCGTTATGCAGATATTCCGGGCTG 

CATAGCATTGACAAATGCTCATACAAAAGAAAATGTTTTATATGAAGAGTTTCTGAAACT 

GTCGTGTCGTTTAGCGGAAAGTTTTAAAAAGTATGGATTAAAACAAAACGACACAATAGC 

GGTGTGTAGCGAAAATGGTCTGCAA T TT T TCCTTCCTGTAATTGCATCATTGTATCTTGG 

AATAATTGTGGCACCTGTTAACGATAAATACATTGAACGTGAATTAATACACA 

TATTGTAAAACCACGCATAATTTTTTGCTCCAAGAATACTTTTCAAAAAGTACTGAATGT 

AAAATCTAAATTAAAATCTGTAGAAACTATTATTATATTAGACTTAAATGAAGACTTAGG 

AGGTrATCAATGCCTCAACAACTTTATTTCTCAAAATTC 

AAAATTTAAACCATATTCTTTTAATCGAGACGATCAGGTTGCGTTGTTAATGT^ 

TGGTACAACTGGTCTGCCGAAGGGAGTCATGCTAACTCACAAGAATATTGTTGCACGATT 

T TCTCTT GCaAAAGAT CCT ACTTTTG GTAACGCAATT AATCCCACGACAGCAATTTT AAC 

GGTAATACCTTTCCACCATGGTTTTGGTATGATGACCACATTAGGATACTTTACTTGTGG 

ATTCCGAGTTGTTCTAATGCACACGTTTGAAGAAAAACTATTTCTA 

TTATAAAGTGGAAAGTACTTTACTTGTACCAACATTAATGGCATTTCTTG 

ATTAGTTGAAAAGTACGATTTATCGCACTTAAAAGAAATTGCATCTGGTGGCGCACCTTT 

ATCAAAAGAAATTGGGGAGATGGTGAAAAAACGGTTTAAATTAAACTTTGTCAGGCAAGG 

GTATGGATTAACAGAAACCACTTCGGCTGTTTTAATTACACCGAAAxxxxxxGCCAAACC 

GGGATCAACTGGTAAAATAGTACCATTTCACGCTGTTAAAGTTGTCGATCCTACAACAGG 

AAAAATTTTGGGGCCAAATGAACCTGGAGAATTGTATTTTAAAGGCCCGATGATAATGAA 

GGGTTATTATAATAATGAAGAAGCTACTAAAGCAATTATTGATAATGACGGATGGTTGCG 

CTCTGGTGATATTGCTTATTATGACAATGATGGCCATTTTTATATTGTGGACAGGCTGAA 

GTCACTGATTAAATATAAAGGTTATCAGGTTGCACCTGCTGAAATTGAGGGAATACTCTT 

ACAACATCCGTATATTGTTGATGCCGGCGTTACTGGTATTCCGGATGAAGCCGCGGGCGA 

GCTTCCAGCTGCAGGTGTTGTAGTACAGACTGGAAAATATCTAAACGAACAAATCGTACA 

AGATTATGTTGCCAGTCAAGTTTCAACAGCCAAATGGCTACGTGGTGGGGTGAAATTTTT 

GGATGAAATTCCCAAAGGATCAACTGGAAAAATTGACAGAAAAGTGTTAAGACAAATGTT 

T GAAAAAC ACACCAATGGG 
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FIGURE 43 



D PMADKN ILYGPEPFYPLE DGT AGEQM FD ALS RY AD I PGC I ALTN AHTKEN VL YEE FLK L 
SCRIJ^SFKKYGLKQNDTIAVCSENGLQFFLPVIASLYI^irVAPVHDKYIERELIHSLG 
IVKPRIIFCSKNTFQKVLNVKSKLKSVETIIILDLNEDLGGYQCLNNFISQNSDINLDVK 
KFKPYSFNRDDQVAIXMFSSGTTGLPKGVMLTHKNIVARFSLAKDPTFGNAINPTTAILT 
VIP FHHG PGMMTTLG YFTCG FRWLMHT FEEK LFLQS LQD YKVEST LLV PT LMAFIAKS A 
LVEKYDLSHLKEIASGGAPLSKEIGEMVKKRFKLNFVRQGYGLTETTSAVLITPKxxAKPG 
STGK I V P FHAVKWD PTTGKI LG PN E PGE LY FKGPMI MKGYYNNEEATKAI I DNDGW LRS 
G D I A YY DNDGH FY I VDRLKS L I KYKG YQV APAE I EGI LLQH P Y I VD AGVTG I PDEAAGE L 
PAAGVWQTGKYLNEQIVQDYVASQVSTAKWLRGGVKF1.de I PKGSTGKI DRKVLRQMFE 
KHTNG 
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FIGURE 44 



(K3ATCCAATGGAAGATAAAAATATTTTATATGGACCT 

ATGTTTTACGCATTATCTCGTTATGCAGATATTTCAGGATGCATAGCATTGACAAATGCTCATACAAAAGAAAATGTTT 

TATATGAAGAGTTTTTAAAATTGTCGTGTCGTTTAGCGGAAAGTTTTAAAAAGTATGGATTAAAAC^ 

AGCGGTGTGTAGCGAAAATGGTTTGCAA1TTTTCCTTCCTTTAATTGCATCATTCT 

GTTAGTGATAAATACATTGAACGTGAATTAATACACAGTCTTGGTATTGTAAAAC^ 

ATACTTTTCAAAAAGTACTGAATGTAAAATCTAAATTAAAATATGTAGAAACTATTA 

CTTAGGAGGTTATCAATGCCTCAACAACTTTATTTCT 

AATTCTTTTAATCGAGACGATCAGGTTGCGTTGGTAATGTTTTCTTCTGGTACAACT 
TAACTCACAAGAATATTGTTGC^CGATTTTCTCATTGCAAAGATCCT^ 

AATTTTAACGGTAATACCTTTCCACCATGGTTTTGGTATGATGACCACATTAGGATACTTTACT 
GCTCTAATGCACACGTTTGAAGAAAAACTATTT^ 

CAACATTAATGGCATTTTTTGCAAAAAGTGCATTAGTTGAAAAGTACGATTTA 
TGGCGCACCTTTATCAAAAGAAATTGGGGAGATGGTGAAAAAACGGTTTAAATTAAACTTTGT 

ttaacagaaaccactto;gctgttttaattac^^ 

TTCACGCTGTTAAAGTTGTCGATCCTACAACAGGAAAAATm 

CGACATGATAATGAAAAGTTATTATAATAATGAAGAAGCTACTAAAGCAATTATTAACAAAGACGGATGCT 

GGTGATATTGCTTATTATGACAATGATGGCCATTTTTATATTGTGGACAGGCTGAAGTCATTAATTAAAT 

ATCAGGTTGCACCTGCTGAAATTGAGGGAATACTCTTACAACATCCGTATATTGTTGATGCCGGCGTTACTGGTATACC 

GGATGAAGCCGCGGGCGAGCTTCCAGCTGCAGGTGTTGTAGTACAGACTGGAAAATATCTAAACGAACAAATCGTACAA 

AATTTTGTTTCCAGTCAAGTTTCAACAGCCAAATGGCTACGTGGTGGGGTGAAATTTTTGGATGAAATTCCCAAAGGAT 

CAACTGGAAAAATTGACAGAAAAGTGTTAAGACAAATGTTTGAAAAACACAAATCTAAGCTG 
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FIGURE 45 



D PME DKN I LYGPE PFY PLADGTAGEQM FYALSRYADISGC I ALTN AHTKENVLYEE FLKL 
SCRIJ^SFKKYGLKQNDTIAVCSENGLQFFLPLIASLYLGIIAAPVSDKYIERELIHSLG 
I VKPRI I FCSKNTFQKVLNVKSKLKYVETI I ILDLNEDLGGYQCLNNFI SQNSDINLDVK 
KFKPNS FNRDDQVALVM FSSGTTGVSKGVMLTHKN I VARFSHCKDPTFGNAIN PTTAI LT 
VIPFHHGEtMbfTTIXjYFrCGFRVAl^HTFEEKLFLQSWDYKVESTLLVPTIllAFFAKSA 
LVEKYDLSHLKE I ASGGAPLSKE IGEMVKKRFKLN FVRQGYGLTETTS AVLIT PDTDVR P 
GSTGKIVPFHAVKVVDPTTGKILGPNETGELYFKGDMIMKSYYNNEEATKAIINKIXJWLR 
SG DI A YY DN DGH FY I VDR LKS L I KYKG YQV A P AE I EG I LLQH PY I VDAGVTG I PDE AAGE 
L PAAG VWQTGKYLNEQ I VQN FVS S QV ST AKWLRGGVKFLDE I PKGSTGKI DRKVLRQM F 
EKHKSKL 
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FIGURE 46 



DPMMKREKNVIYGPEPLHPLEDLTAGEMLFRALRKHSHLPQALVDWGDESLSYKEFFEA 
TVLLAOSLHNCGYKMNDVVSICAENNTRFFIPVIAAWYIGMIVAPVNESYIPDELCKVMG 
ISKPQIVFTTKNILNKVLEVQSRTNFIKRIIILDTVENIHGCESLPNFISRYSDGNIANF 
KPLHFDPVEQVAAILCSSGTTGLPKGVMQTHQNICVRLIHALDPRAGTQLIPGVTVLVYL 
PFFHAFGFSITLGYFMVGLRVIMFRRFDQEAFLKAIQDYEVRSVINVPSVILFLSKSPLV 
DKYDLSSLRELCCGAAPLAKEVAEVAAKRLNLPGIRCGFGLTESTSANIHSLRDEFKSGS 
LGRVTPLMAAKIADRETGKALGPNQVGELCIKGPMVSKGYVNNVEATKEAIDDDGWLHSG 
DFGYYDEDEHFYWDRYKELIKYKGSQVAPAELEEILLKNPCIRDVAWGIPDLEAGELP 
SAFWKQPGKEITAKEVYDYLAERVSHTKYLRGGVRFVDSIPRNVTGKITRKELLKQLLE 
KAGG 
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FIGURE 47 



GGATCCCATGATGAAGCGAGAGAAAAATGTTATATATGGACCCGAACCCCTACACCCCTT 
GGAAGACTTAACAGCTGGAGAAATGCTCTTCCGTGCCCTTCGAAAACATTCTCATTTACC 
GCAGGCTTTAGTAGATGTGGTTGGCGACGAATCGCTTTCCTATAAAGAGTTTTTTGAAGC 
GACAGTCCTCCTAGCGCAAAGTCTCCACAATTGTGGATACAAGATGAATGATGTAGTGTC 
GATCTGCGCCGAGAATAATACAAGATTTTTTATTCCCCTTATTGCAGCTTGGTATATTGG 
TATGATTGTAGCACCTGTTAATGAAAGTTACATCCCAGATGAACTCTGTAAGGTGATGGG 
TATATCGAAACCACAAATAGTTTTTACGACAAAGAACATTTTAAATAAGGTATTGGAGGT 
ACAGAGCAGAACTAATTTCATAAAAAGGATCATCATACTTGATACTGTAGAAAACATACA 
CGGTTGTGAAAGTCTTCCCAATTTTATTTCTCGTTATTCGGATGGAAATATTGCCAACTT 
CAAACCTTTACATTTCGATCCTGTTGAGCAAGTGGCAGCTATCTTATGTTCGTCAGGCAC 
TACTGGATTACCGAAAGGTGTAATGCAAACTCACCAAAATATTTGTGTCCGACTTATACA 
TGCTTTAGACCCCAGGGCAGGAACGCAACTTATTCCTGGTGTGACAGTCTTAGTATATCT 
GCCTTTTTTCCATGCTTTTGGGTTCTCTATAACCTTGGGATACTTCATGGTGGGTCTTCG 
TGTTATCATGTTCAGACGATTTGATCAAGAAGCATTTCTAAAAGCTATTCAGGATTATGA 
AGTTCGAAGTGTAATTAACGTTCCATCAGTAATATTGTTCTTATCGAAAAGTCCTTTGGT 
TGACAAATACGATTTATCAAGTTTAAGGGAATTGTGTTGCGGTGCGGCACCATTAGCAAA 
AGAAGTTGCTGAGGTTGCAGCAAAACGATTAAACTTGCCAGGAATTCGCTGTGGATTTGG 
T T TG ACAG AATC T AC TT CAGCT AAT AT ACACAGTCTTAGGGAT G AATTTAAATCAGG ATC 
ACTTGGAAGAGTTACTCCTTTAATGGCAGCTAAAATAGCAGATAGGGAAACTGGTAAAGC 
ATTGGGACCAAATCAAGTTGGTGAATTATGCATTAAAGGTCCCATGGTATCGAAAGGTTA 
CGTGAACAATGTAGAAGCTACCAAAGAAGCTATTGATGATGATGGTTGGCTTCACTCTGG 
AGACTTTGGATACTATGATGAGGATGAGCATTTCTATGTGGTGGACCGTTACAAGGAATT 
GATTAAATATAAGGGCTCTCAGGTAGCACCTGCAGAACTAGAAGAGATTTTATTGAAAAA 
TCCATGTATCAGAGATGTTGCTGTGGTTGGTATTCCTGATCTAGAAGCTGGAGAACTGCC 
ATCTGCGTTTGTGGTTAAACAGCCCGGAAAGGAGATTACAGCTAAAGAAGTGTACGATTA 
TCTTGCCGAGAGGGTCTCCCATACAAAGTATTTGCGTGGAGGGGTTCGATTCGTTGATAG 
CATACCAAGGAATGTTACAGGTAAAATTACAAGAAAGGAACTTCTGAAGCAGTTGCTGGA 
GAAGGCGGGAGGT 
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